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without analyzing the data recorded to the data record- 
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Technical Field 

5 [0001] The present invention relates generally to a readable, writable data recording medium, and relates more 
particularly to a data recording medium for recording moving picture (video) data, still image data, audio data, and 
other types of multimedia data in data broadcasting and various other formats. Our invention also relates to an appa- 
ratus and method for recording data to and playing data back from such a data recording medium. 

10 Background Art 

[0002] Rewritable optical discs have had a maximum storage capacity of approximately 650 MB, but this limit has 
been pushed to several gigabytes by the introduction of DVD-RAM discs, a phase-change type of storage medium. 
Used in conjunction with practical implementations of MPEG (particularly MPEG-2), a digital AV data encoding stand- 
's ard, DVD-RAM is not limited to computer applications and will soon find widespread use as a recording and playback 
medium in the audio-video (AV) and even home entertainment industries. 

[0003] With the start of digital broadcasts in Japan it has become possible to multiplex and simultaneously transmit 
the video, audio, and data portions of plural programs to the MPEG transport stream ("MPEG.TS" below). Digital 
broadcast recorders using hard discs or DVD media to record these programs are also available. 
20 [0004] These next-generation digital broadcast recorders typically record digital broadcasts in the original broadcast 
format without converting the MPEG_TS of the broadcast, and are expected to record AV data from an external line 
input using the MPEG_TS so that the recorder does not need to internally handle both the MPEG program stream 
("MPEG.PS" below) and the MPEG_TS. 

[0005] However, because the current DVD logic standards (including the DVD-Video standard, DVD-Audio standard, 
25 DVD Video Recording standard, and DVD Stream Recording standard) use the MPEG_PS for AV stream recording! 
MPEG_TS to MPEG_PS conversion (TS2PS conversion) is required in order to convert content recorded in the 
MPEG_TS format, such as by the above-noted digital broadcast recorder, to the DVD-Video format, for example. 
[0006] Converting a stream multiplexed to the MPEG_TS to MPEG_PS, however, involves a complex recalculation 
for decoder buffer management, the TS2PS conversion is time-consuming, and often involves re-encoding the ele- 
30 mentary stream, resulting in degraded image quality and sound quality. 

Disclosure of Invention 

[0007] The present invention is directed to solving these problems and an object of the invention is to provide a data 
35 recording medium for recording an MPEG_TS stream enabling fast, simple conversion when converting content re- 
corded in the MPEG_TS format to the MPEG_PS format. A further object is to provide an apparatus and a method for 
recording, converting, and playing back data using the data recording medium of the invention. 

[0008] To achieve these objects a data recording medium according to the present invention records video data and 
audio data encoded in a constraint format enabling conversion from a first stream to a second stream by applying 

*o specific constraints to a specific format of the first stream. 

[0009] The first stream stores data segmented into packets and has a packet structure to which time stamp infor- 
mation indicating a relative transfer timing is added to each packet. In this first stream the transfer rate allowed for 
packets storing video data is greater than or equal to the transfer rate allowed for packets storing audio data. 
[0010] The second stream stores data segmented into packs and has a pack structure to which time stamp informa- 

*s tion indicating a relative transfer timing is added to each pack. In this second stream the transfer rate allowed for packs 
storing video data is equal to the transfer rate allowed for packs storing audio data. 
[0011] The constraint format applies the following constraints. 

[0012] Specifically, a specified number of first stream packets are grouped and managed as a unit, the number of 
first stream packets in the unit determined so that the total size of the specified number of packets managed as a unit 

>o does not exceed the size of one pack in the second stream. In addition, a control packet containing time stamp infor- 
mation indicating absolute transfer timing and time stamp information indicating relative transfer timing is inserted every 
specified number of units. The time stamp information indicating the relative transfer timing of a packet following the 
control packet is set based on the control packet transfer timing. In addition, flag information indicating if video data 
and audio data recorded in the constraint format is recorded to the data recording medium. 

* [0013] The flag information is preferably provided for each video data stream recorded to the data recording medium, 
and is recorded in management information comprising information specific to the video data. 

[0014] A data recording apparatus according to the present invention encodes and records video data and audio 
data in a constraint format enabling conversion from a first stream to a second stream by applying specific constraints 
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to a specific format of the first stream. 

[0015] The data recording apparatus has encoding means for encoding video data and audio data to be recorded; 
a drive means for writing the encoded video data and audio data to the data recording medium; and control means for 
controlling the encoding means and drive means. 

5 [0016] The first stream stores data segmented into packets and has a packet structure to which time stamp infor- 
mation indicating a relative transfer timing is added to each packet. In this first stream the transfer rate allowed for 
packets storing video data is greater than or equal to the transfer rate allowed for packets storing audio data. 
[0017] The second stream stores data segmented into packs and has a pack structure to which time stamp informa- 
tion indicating a relative transfer timing is added to each pack. In this second stream the transfer rate allowed for packs 

10 storing video data is equal to the transfer rate allowed for packs storing audio data. 
[0018] The constraint format applies the following constraints. 

[0019] Specifically, a specified number of first stream packets are grouped and managed as a unit, the number of 
first stream packets in the unit determined so that the total size of the specified number of packets managed as a unit 
does not exceed the size of one pack in the second stream. In addition, a control packet containing time stamp infor- 
ms mation indicating absolute transfer timing and time stamp information indicating relative transfer timing is inserted every 
specified number of units. The time stamp information indicating the relative transfer timing of a packet following the 
control packet is set based on the control packet transfer timing. In addition, flag information indicating if video data 
and audio data recorded in the constraint format is recorded to the data recording medium. 

[0020] The control means controls the encoding means and drive means to encode and record video data and audio 
20 data in the constraint format, and to record flag information indicating if video data and audio data recorded in the 
constraint format is recorded to the data recording medium. 

[0021] A data recording method according to the present invention encodes and records video data and audio data 
in a constraint format enabling conversion from a first stream to a second stream by applying specific constraints to a 
specific format of the first stream. 

25 [0022] The first stream stores data segmented into packets and has a packet structure to which time stamp infor- 
mation indicating a relative transfer timing is added to each packet. In this first stream the transfer rate allowed for 
packets storing video data is greater than or equal to the transfer rate allowed for packets storing audio data. 
[0023] The second stream stores data segmented into packs and has a pack structure to which time stamp informa- 
tion indicating a relative transfer timing is added to each pack. In this second stream the transfer rate allowed for packs 

30 storing video data is equal to the transfer rate allowed for packs storing audio data. 

[0024] The data recording method has a step for grouping and managing as a unit a specific number of first stream 
packets; a step for inserting a control packet containing time stamp information indicating absolute transfer timing and 
time stamp information indicating relative transfer timing every specified number of units; and a step for recording flag 
information indicating if video data and audio data recorded in the constraint format is recorded to the data recording 

35 medium. 

[0025] The number of first stream packets in the unit is determined so that the total size of the specified number of 
packets managed as a unit does not exceed a size of one pack in the second stream. The time stamp information 
indicating the relative transfer timing of a packet following the control packet is set based on the control packet transfer 
timing. 

40 [0026] Other objects and attainments together with a fuller understanding of the invention will become apparent and 
appreciated by referring to the following description and claims taken in conjunction with the accompanying drawings. 

Brief Description of Drawings 

45 [0027] 

Fig. 1 is a schematic diagram showing a DVD recording apparatus and an exemplary interface between the DVD 
recording apparatus and other components used in conjunction therewith. 
Fig. 2 is a block diagram of the drive apparatus of a DVD recorder. 
50 Fig. 3A illustrates a contiguous area on the disc, and Fig. 3B is a graph illustrating the data accumulation in a track 

buffer. 

Fig. 4 is a block diagram of a DVD recorder having a semiconductor memory card and hard disk drive. 
Figs. 5A and 5B show a disc and physical structure of the disc. 
Figs. 6A and 6B show the logical data space of the disc. 
55 Fig. 7 shows the disc directory and file structure. 

Fig. 8 shows the structure of a video object. 
Fig. 9 shows the MPEG system stream. 

Figs. 10A to 10C show the MPEG transport stream (MPEG_TS). 
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Figs. 11 A to 11 C show the MPEG program stream (MPEG_PS). 
Figs. 12A to 12D show a TS packet. 
Figs. 13A to 13C2 shows a PAT table. 

Figs. 14A to 14C show the arrangement of video objects on disc. 
s Figs. 15A and 15B show the data structure of video management information. 

Figs. 16A and 16B show the data structure of video management information. 

Fig. 17 shows the relationship between an object, object information, and PGC information in the video manage- 
ment information. 

Fig. 18 is a block diagram showing the functional configuration of a playback apparatus. 
io Fig. 19 is a block diagram showing the functional configuration of a recording apparatus. 

Fig. 20 is a block diagram showing the configuration of a data recording and reproducing apparatus according to 
the present invention. 

Fig. 21 shows the structure of a self -encoding stream. 
Figs. 22A and 22B describes the packet transfer time interval. 
15 Fig. 23 describes a storage method for a User Private packet. 

Fig. 24 describes a storage method for a User Private packet. 
Fig. 25 describes a storage method for a User Private packet. 
Fig. 26 describes a storage method for a User Private packet. 

Figs. 27A to 27H described conversion of an MPEG_TS to an MPEG_PS. It should be noted that, in this figure, 
20 Capsule Header and ATS are omitted because those have little relativity to the present invention, and that each 

pack in converted MPEG-PS shown in Figs. 27G and 27H is stuffed or padded according to byte length or VOBU 
alignment of stored elementary. 

Figs. 28A to 28G show an encoding method for an MPEG_TS enabling easy conversion to an MPEG_PS. 
Fig. 29 shows conversion to a DVD Video format (NTSC). 
25 Fig. 30 shows conversion to a DVD Video format (PAL). 

Fig. 31 shows the internal data structure of a User Private packet. It should be noted that Ceil() means to round 
out the decimal place. 

Fig. 32 shows the correlation between an MPEG_TS encoded for easy conversion to an MPEG_PS and the 
MPEG_PS after conversion. 

30 Fig. 33 is a block diagram of the encoder of a data recording apparatus according to the present invention. 

Fig. 34 shows differences in processes for converting from a self-encoded MPEG_TS to DVD formats due to 

differences in system encoding. 

Fig. 35 shows the Tip packet data structure. 

Fig. 36 shows the adaptation field data structure. 
35 Fig. 37 shows the Data_ID data structure. 

Fig. 38 shows the display_and_copy_info data structure. 

Fig. 39 shows the encodejnfo data structure. 

Fig. 40 shows the PESJnfo data structure. 

Fig. 41 shows the Makers Private Data data structure. 
40 Fig. 42A shows PID of the Tip packet. 

Fig. 42 B shows the stream_type. 

Fig. 43 shows field values of the PES packet header in a Constrained SESF stream. 

Fig. 44 shows the PES_extension_flag and PES_header_dataJength in a Constrained SESF stream. It should 

be noted that in the figure, VPD is 0 when PTS_DTS_flag = 00b, VPD is 5 when PTS_DTS_flag = 01b, and VPD 
^5 is 1 0 when PTS_DTS_f lag = 11b. 

Fig. 45 shows an example of an MPEG_TS self -encoded such that it does not satisfy T_STD model. 

Figs. 46A and 46B show an example of an MPEG_PS converted from a MPEG_TS such that the MPEG_PS does 

not satisfy the P_STD model. 

Fig. 47 shows SCR calculation. 
50 Fig. 48 shows the elementary stream attributes of a Constrained SESF when encode_condition = 11b. 

Fig. 49 shows the elementary stream attributes of a Constrained SESF when encode_condition = 01b. 

Fig. 50 shows the stream structure of a format conforming to the DVD Video standard. 

Fig. 51 shows the structure of PCI data in NV_PCK. 

Fig. 52 shows the structure of PCI_GI data in NV_PCK. 
55 Fig. 53 shows the structure of DSl data in NV_PCK. 

Fig. 54 shows the structure of DSI_GI data in NV_PCK. 

Fig. 55 shows the structure of SML_PBI data in NV_PCK. 

Fig. 56 shows the structure of SYNCI data in NV_PCK. 



4 



BNSDCCID: <EP. 



.142271 OA 1_l_> 



EP1 422 710 A1 



Fig. 57 shows the stream structure of a format conforming to the DVD Video Recording standard. 

Fig. 58 is a flow chart of the TS packet (RD_PCK) conversion process. 

Fig. 59 is a flow chart of the TS packet (V_PCK, A_PCK) conversion process. 

Fig. 60 shows a part of the data structure of the pack header in an MPEG-2 program stream pack. 

Fig. 61 shows a DVD format system header. 

Fig. 62A shows the structure of a packet header stored in RDI_PCK. 
Fig. 62B shows the structure of a packet header stored in RDI_PCK. 

Fig. 63 shows a part of the data structure of the packet header in an MPEG-2 program stream packet. 

Fig. 64 shows the structure of an AC-3 standard private header in the DVD format. 

Figs. 65A and 65B show converting a Constrained SESF to an MPEG_PS for a video pack. 

Figs. 66A and 66B show converting a Constrained SESF to an MPEG_PS for an audio pack. 

Fig. 67 is a table of audio bit rates allowed by the Constrained SESF, and the maximum payload length stored to 

one audio PES packet for AC-3 and MPEG-1 Audio at the corresponding bit rates. 

Fig. 68 is a flow chart of overall TS2PS conversion process. 

Fig. 69 is a flow chart of initialization process in the TS2PS conversion process. 

Fig. 70 is a flow chart of the capsule unit process in the TS2PS conversion process. 

Fig. 71 is a flow chart of the pack unit process. 

Fig. 72 is a flow chart of the SCR calculation process. 

Fig. 73 is a flow chart of the pack header process. 

Fig. 74 is a flow chart of the packet header process. 

Fig. 75 is a flow chart of the stream ID process. 

Fig. 76 is a flow chart of the PES packet leading process. 

Fig. 77 is a flow chart of the non-PES packet leading process. 

Fig. 78 is a flow chart of the payload process. 

Fig. 79 is a flow chart of the padding packet process. 

Fig. 80 shows the Constrained SESF stream format. 

Fig. 81 shows the data structure of an MPEG standard PES packet. 

Fig. 82 shows a method of generating NV_PCK data. 



Best Mode for Carrying Out the Invention 



[0028] A DVD disc, DVD recorder, and DVD player are described with reference to the accompanying figures in the 
sequence shown below as preferred embodiments of a data recording medium, recording apparatus, and playback 
apparatus according to the present invention. 

[0029] Key points of the present invention are described particularly in the following section 8, outline of the invention, 
and section 9, detailed embodiments of the invention. While the relationship to the present invention may vary, all of 
the following describe various aspects of the invention. 



1 . Outline of the DVD recorder system 

2. Function outline of the DVD recorder 

3. Outline of the DVD disc 

4. Outline of reproduced AV data 

5. AV data management information and playback control 

6. Basic operation of the playback function 

7. Basic operation of the recording function 

8. Outline of the invention 

9. Detailed embodiments of the invention 



[0030] The following terminology is used below. 

[0031] "TS2PS conversion" refers to converting the MPEG transport stream (MPEG_TS) to the MPEG program 
stream (MPEG_PS). 

[0032] "DVD format" refers to both the DVD-Video standard format and the DVD-Video Recording standard format, 
each being an MPEG_PS implementation. 



1 . Outline of the DVD recorder system 



[0033] Fig. 1 shows a typical DVD recorder in relation to other systems and devices used with the DVD recorder. 
[0034] As shown in Fig. 1 a DVD, which is a type of optical disc, is loaded to the DVD recorder for recording video 
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data to the disc and reproducing video data from the disc. A remote control device is typically used to operate the DVD 
recorder. 

[0035] The video data input to the DVD recorder could be an analog signal or a digital signal with analog broadcasts 
exemplary of analog signals and digital broadcasts exemplary of digital signals. Generally speaking, analog broadcasts 
5 are received and demodulated by the receiver built in to a television, and input as an NTSC or other analog video 
signal to the DVD recorder for recording. Digital broadcasts are demodulated to a digital signal by the digital broadcast 
receiver (set-top box (STB)) input to the DVD recorder for recording. 

[0036] Video data recorded to a DVD is reproduced by the DVD recorder and externally output. Like the video input, 
video output may be an analog signal or digital signal. Analog signals are input directly to the television. Digital signals 
10 are passed through the STB and converted to an analog signal, which is then input to the television for video presen- 
tation . 

[0037] Video data may also be recorded to and reproduced from a DVD by a device other than a DVD recorder, such 
as a DVD camcorder or personal computer. A DVD disc storing video data recorded by a device other than a DVD 
recorder will also be reproduced by the DVD recorder when loaded therein. 
15 [0038] It should be noted that audio data is normally associated with the video data of an analog broadcast or digital 
broadcast, and this audio data is likewise recorded and reproduced by the DVD recorder. 

[0039] Furthermore, the video data is generally moving picture data, but could also include still images such as when 
a still image (photograph) is captured using the snapshot function of a DVD camcorder. 

[0040] IEEE 1394, ATAPI, SCSI, or other standard could be used for the digital interface between the STB and DVD 
20 recorder. 

[0041 ] It should also be noted that an NTSC signal is referred to above as the type of component video signal passed 
between the DVD recorder and 'television, but a component signal sending separate luminance and color difference 
signals could be used. Furthermore, changing the interface for transmitting video between AV components and tele- 
visions from an analog interface to a digital interface such as DVI is currently being researched, and we anticipate that 
a digital interface can also be used to connect DVD recorders and televisions. 

2. Function outline of the DVD recorder 

[0042] Fig. 2 is a function block diagram of a DVD recorder. The drive device has an optical pickup 1 01 for reading 
data from a DVD-RAM disc 1 00, an ECC (error correction code) processor 1 02, track buffer 1 03, switch 1 04 for changing 
track buffer 1 03 input and output, an encoder 1 05, and a decoder 1 06. 

[0043] As shown in the figure data is recorded to the DVD-RAM disc 1 00 with the smallest recording unit being one 
sector (= 2 KB). Furthermore, 32 sectors equal 1 ECC block, and the ECC processor 102 applies error correction 
processing using ECC block units. 

[0044] The DVD recorder could also use semiconductor memory cards or hard disk drives in addition to DVDs as 
data storage media. Fig. 4 is a block diagram of a DVD recorder having a semiconductor memory card and hard disk 
drive. 

[0045] It should also be noted that 1 sector could be 51 2 bytes, 8 KB, or other size. The ECC block could also contain 
1 sector, 16 sectors, 32 sectors, or other configuration. It is expected that the sector size and number of sectors in 
each ECC block will also increase as the recordable data capacity increases. 

[0046] The track buffer 103 is a buffer for recording AV data at a variable bit rate (VBR) in order to record AV data 
more efficiently to the DVD-RAM disc 100. The DVD-RAM disc 100 write rate (Va) is a fixed rate but the bit rate (Vb) 
of the AV data varies according to the complexity of the AV content (images in the case of video content). The track 
buffer 1 03 is used to absorb this bit rate difference. 

[0047] In order to use this track buffer 1 03 even more effectively, the AV data can be distributively recorded to the 
disc 100. This is further described with reference to Figs. 3A and 3B. 

[0048] Fig. 3A shows the disc address space. As shown in Fig. 3A, continuous playback of the AV data is enabled 
when the AV data is recorded to separate contiguous spaces [a1 , a2] and [a3, a4] by supplying data accumulated in 
the track buffer to the decoder 1 06 while seeking from a2 to a3. The change in the amount of data stored to the track 
buffer at this time is shown in Fig. 3B. 

[0049] When reading starts at address a1 , the AV data is input from time t1 to the track buffer 103 and data output 
from the track buffer 103 also starts. Data then accumulates in the track buffer 103 at the rate (Va-Vb), that is, the 
difference between the input rate (Va) to the track buffer 1 03 and the track buffer output rate (Vb). This continues until 
the search area reaches a2, that is, until time t2. If the data accumulated in the track buffer 103 during this time is B 
(t2), data can be supplied to the decoder 1 06 by gradually depleting the data B(t2) accumulated in the track buffer 1 03 
from time t2 to the time t3 at which reading from the address a3 begins. 

[0050] In other words, a continuous supply of AV data can be maintained during seek operations insofar as at least 
a specified amount of data ([a1 , a2]) has been read before the seek operation starts. 
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[0051] The size of the contiguous area required to enable continuous AV data output when converted to an ECC 
block count (N_ecc) is shown by the following equation: 

5 N_ecc = Vb*Tj/((N_sec*8*S_size)*(1 -Vb/Va)) 

where N_sec is the number of sectors in an ECC block, S_size is the sector size, and Tj is the seek performance 
(maximum seek time). 

[0052] A defective sector could also occur in a contiguous area. The required size of the contiguous area in this case 
10 is shown by the following equation: 

N_ecc = dN_ecc+Vb*Tj/((N„sec*8*S_size)*(1-VbA/a)) 

15 where dN_ecc is the size of the allowed defective sector, and Ts is the time needed to skip the defective sector within 
the contiguous area. This equation also returns the size of the contiguous area as the number of ECC blocks. 
[0053] The above example is described using reading data from a DVD-RAM disc, that is, data playback, by way of 
example, but it will be obvious that writing, that is, recording, data to the DVD- RAM disc can be handled in the same way. 
[0054] Continuous data playback and recording can thus be achieved with a DVD-RAM disc even when the AV data 

20 is recorded to separate recording areas on the disc insofar as the data is recorded in blocks of a specific size or more. 
These contiguous areas are referred to as Contiguous Data Areas (CDA) in DVD terminology. 

3. Outline of the DVD disc 

25 [0055] Figs. 5A and 5B show the physical structure and a plan view of a DVD-RAM, i.e., a recordable optical disc. 
DVD-RAM discs are typically housed in a cartridge for loading to a DVD recorder. The purpose of the cartridge is to 
protect the disc. The DVD-RAM disc can, however, be loaded directly to the DVD recorder without being housed in a 
cartridge if the recording surface can be protected in some other way. 

[0056] DVD-RAM discs are recorded using a phase-change recording technique. Data on the disc is managed by 
30 sector unit, and addresses are added for data access. Groups of 32 sectors are used for error correction, 'have an 
error correction code added thereto, and are referred to as ECC blocks. 

[0057] Fig. 5A shows the recording area of a DVD-RAM disc, i.e., a recordable optical disc. As shown in the figure, 
a DVD-RAM disc has a lead-in area at the inside circumference, a lead-out area at the outside circumference, and a 
data area between the lead-in and lead-out areas. 
35 [0058] Reference signals for stabilizing the servo when accessing the disc with the optical pickup, and an ID signal 
for distinguishing a DVD-RAM disc from other types of media, are recorded to the lead-in area. 
[0059] The same reference signals are also recorded to the lead-out area. 

[0060] The data area is segmented into sectors (each 2048 bytes) as the smallest access unit. The data area is also 
segmented into a plurality of zones in order to apply a rotational control technique known as Zone Constant Linear 

40 Velocity (Z-CLV) during recording and playback. 

[0061] Fig. 5A shows plural zones formed concentrically on the DVD-RAM disc. In this example the DVD-RAM disc 
is divided into 24 zones, labelled zone 0 to zone 23. The rotational angular velocity of the DVD-RAM is set differently 
in each zone such that it increases in proximity to the inside circumference and is constant while the optica! pickup 
accesses data in the same zone. This increases the recording density of the DVD-RAM and enables easier rotational 

45 control during recording and playback. 

[0062] Fig. 5B shows the lead-in area, lead-out area : and zones 0 to 23 concentrically arranged in Fig. 5A when 
viewed in a line through the disc radius. 

[0063] The lead-in area and lead-out area each include a defect management area (DMA). The defect management 
area, is for recording position information indicating the location of a sector containing a defect, and substitute sector 

50 position information indicating in which substitute area the sector substituted for the defective sector is located. 

[0064] Each zone includes a user area between a substitute area and an unused area. The user area is the area 
that can be used by the file system as a recording area. The substitute area is the area substitutional^ used when 
there is a defective sector. The unused area is an area not used for data recording, and is approximately two tracks 
wide. The sector address is recorded to the same position in adjacent tracks within each zone, but with Z-CLV the 

55 sector address is recorded to a different position in tracks adjacent to the zone boundary. This unused area is therefore 
provided to prevent sector address detection errors in tracks adjacent to the zone boundary. 

[0065] There are, therefore, sectors not used for data recording at the zone boundaries. A logical sector number 
(LSN) is therefore assigned to each physical sector in the user area of a DVD-RAM disc sequentially from the inside 
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circumference in order to continuously identify only those sectors used for data recording. 

[0066] Fig. 6 shows the logical data space of a DVD- RAM disc comprising logical sectors. The logical data space is 
called the "volume space" and is used to record user data. 

[0067] Data recorded in the volume space is managed with a file system. More specifically, a group of sectors storing 
5 data is a "file," and volume structure information managing a group of files as a "directory" is recorded to the beginning 
and end of the volume area. The UDF file system is used in the present embodiment and conforms to ISO 13346. 
[0068] The above-noted group of sectors is not necessarily contiguous within the volume space, and can be split 
into separate parts. Of the sectors constituting each file, the file system therefore manages each group of contiguous 
sectors in the volume space as an extent, and manages each file as a set of related extents. 
w [0069] Fig. 7 shows the structure of a directory and file recorded to DVD-RAM. Below the root is the VIDEO_RT 
directory, and below VIDEO_RT are the various object files containing the playback data and a VIDEO Manager file 
containing management information such as the playback sequence and various attributes. 

[0070] Objects are data structures conforming to MPEG standards, and include PS_VOB, TS1_VOB, TS2_VOB, 
AOB, POB, and MNF (Manufacturer's Private Data). 
15 [0071] PS_VOB, AOB, and POB are MPEG program streams (PS), and TS1_VOB and TS2_VOB are MPEG trans- 
port streams (TS). The program stream has a data structure designed for storing AV data to package media. The 
transport stream has a data structure intended for communications media. 

[0072] PS_VOB, TS1_VOB and TS2_VOB are objects of primarily video data but containing both video data and 
audio data. In principle, TS1_VOB objects are encoded by the DVD recorder with an explicitly managed internal picture 
20 structure. TS2_VOB objects are encoded externally to the DVD recorder, and part of the internal picture structure and 
data structure is unknown. 

[0073] Typically, TS1_VOB objects are externally input analog video signals encoded by the DVD recorder to the 
transport stream, and TS2_VOB objects are externally input digital video signal objects recorded directly to disc without 
further encoding by the DVD recorder. 
25 [0074] AOB and POB are MPEG program streams. AOB objects contain primarily audio data, and POB objects 
contain primarily still images. 

[0075] The MNF (Manufacturer's Private Data) block is used to store information specific to a particular manufacturer. 
[0076] "Primarily video data" and "primarily audio data" above indicate that a high bit rate is allocated. VOB are used 
in video and similar applications, and AOB are used in music applications. 

30 

4. Outline of reproduced AV data 

[0077] Fig. 8 shows the structure of MPEG data recorded as AV objects to a DVD. 

[0078] As shown in Fig. 8, the video stream and audio stream are segmented and multiplexed. The MPEG standard 
35 refers to the multiplexed streams as the system stream. In the case of DVD, a system stream containing DVD specific 
settings is called a VOB (Video OBject). The segmentation units are called packs and packets, and are approximately 
2 KB in size. 

[0079] The video stream is encoded according to the MPEG standard, variable bit rate compressed such that the 
bit rate is increased in complex images such as images containing much movement. The pictures in an MPEG stream 
40 are encoded as l-pictures, P-pictures, or B-pictures. l-pictures are spatially compressed and complete within each 
frame. P-pictures and B-pictures are temporally compressed using inter-frame correlations. A series of pictures includ- 
ing at least one l-picture is referred to as a Group of Pictures (GOP) in MPEG. A GOP is the access point for fast play 
and other special play modes, which are made possible by the presence of at least one intra-frame compressed I- 
picture. 

45 [0080] In addition to using MPEG audio, the audio stream of a DVD can be encoded using AC-3, LPCM, or other 
encoding technique. 

[0081] As also shown in Fig. 8 the Video Object Unit (VOBU) is the data unit multiplexing the video data of a GOP 
with the associated audio data. Video management data can also be included in a VOBU as header information. 
[0082] A program stream (PS) and transport stream (TS) are included in the system stream described with reference 
50 to Fig. 8. As noted above, the program stream has a data structure intended for package media and the transport 
stream data structure is intended for communications media. 

[0083] Fig. 9 shows the concept of the program stream and transport stream data structures. 
[0084] The program stream comprises fixed length packs that are the smallest unit for data transfer and multiplexing. 
Each pack contains one or more packets. Both packs and packets comprise a header part and a data part. The data 
55 part is referred to as the payload in MPEG. For compatibility with the sector size, the fixed length of a pack in DVD is 
2 KB. A pack can contain multiple packets, but because packs storing DVD video and audio contain only one packet, 
1 pack equals 1 packet except in special cases. 

[0085] The data transfer and multiplexing unit of the transport stream comprises fixed length TS packets. TS packet 
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size is 1 88 bytes for compatibility with ATM transmissions, a communications standard. One or more TS packets form 
a PES packet. 

[0086] PES packets are a concept common to both the program stream and transport stream, and the data structure 
is the same. Packets stored in program stream packs directly form PES packets, and a group of one or more transport 

5 stream TS packets form a PES packet. 

[0087] The PES packet is the smallest encoding unit and stores video data and audio data with common encoding. 
More specifically, video data and audio data encoded with different coding methods are not present in a same PES 
packet. However, if the coding method is the same, it is not necessary to ensure the picture boundaries and audio 
frame boundaries. As shown in Fig. 9 one frame is stored to plural PES packets, and plural frames may be stored to 

io one PES packet. 

[0088] Figs. 1 0A to 1 0C and Figs. 1 1 A to 1 1 C show the data structures of the transport stream and program stream. 
[0089] As shown in Figs. 10Ato 10Cand Figs. 12Ato 12D : each TS packet comprises a TS packet header, adaptation 
field, and payload. The TS packet header stores a Packet Identifier (PID) whereby the video, audio, or other stream 
to which the TS packet belongs can be identified. 
15 [0090] The Program Clock Reference (PGR) is stored to the adaptation field. The PCR is the reference value for the 
system time clock (STC) of the device decoding the stream. The device typically demultiplexes the system stream 
based on the PCR timing, and then reassembles the video stream and other streams. 

[0091] The Decoding Time Stamp (DTS) and Presentation Time Stamp (PTS) are stored to the PES header. The 
DTS indicates the decoding timing of the picture or audio frame stored to the PES packet, and the PTS indicates the 
20 presentation timing of the video or audio output. 

[0092] It should be noted that the PTS and DTS need not be written to every PES packet header. Decoding and 
output are possible insofar as the PTS and DTS are written to the header of the PES packet where the first data of the 
l-picture is stored. 

[0093] The TS packet structure is shown in detail in Figs. 12A to 12D. 
25 [0094] As shown in Figs. 12A to 12D the adaptation field stores the PCR and a random access presentation flag. 
This flag indicates whether data that is at the beginning of the video or audio frame and can be used as an access 
point is stored in the corresponding payload. In addition to the above-noted PID, the TS packet header also stores a 
unit start presentation flag indicating the beginning of a PES packet, and adaptation field control data indicating whether 
an adaptation field follows. 

30 [0095] Figs. 11 A to 11 C show the structure of packs in the program stream. The pack contains the SCR in the pack 
header and a streamjd in the packet header of packets stored in the pack. The SCR is effectively identical to the 
transport stream PCR, and the streamjd to the PID. The PES packet data structure is also the same as in the transport 
stream, and the PTS and DTS are stored in the PES header. 

[0096] One major difference between the program stream and transport stream is that the transport stream allows 
35 for multiple programs. That is, in terms of program units, the program stream can carry only one program but the 

transport stream can simultaneously transmit multiple programs. This means that the playback device must be able 

to identify the video streams and audio streams constituting each program carried in the transport stream. 

[0097] Figs. 13A to 13C2 shows the PAT table and PMAP table used to transmit structure information for the audio 

stream and video stream of each program. As shown in Figs. 13C1 and 13C2 the EMAP table stores information 
*o relating to the combination of video and audio streams used in each program, and the PAT table stores information 

correlating programs and PMAP tables. The playback device can therefore reference the PAT table and PMAP table 

to detect the video and audio streams for the program to be output. 

[0098] How the program stream packs and transport stream TS packets described above are arranged on the disc 

is described next with reference to Figs. 14A to 14C. 
45 [0099] As shown in Fig. 14A there are 32 sectors in an ECC block. 

[0100] As shown in Fig. 14B, the packs (PS Packs) forming a video object (PS_VOB) of a program stream type are 

located at the sector boundaries. This is because the pack size and sector size are both 2 KB. 

[0101] Video objects (TS1_VOB, TS2_VOB) of the transport stream type, however, are 8 KB units and are therefore 

contained in the ECC block. Each 8 KB unit contains an 18 byte header area and 43 TS packets containing Arrival 
50 Time Stamp (ATS) information in the data area. The ATS information is data generated and added by the DVD recorder, 

and indicates the timing at which the packet was received by the DVD recorder from an external source. 

[0102] It should be noted that an MPEG_TS storage format continuously recording combinations of fixed-byte length 

ATS and MPEG_TS packets is also possible as shown in Fig. 14C. 

55 5. AV data management information and playback control 

[0103] Figs. 15A to 15B and Figs. 16A to 16B show the data structure of the video management information file 
(Video Manager) shown in Fig. 7. 
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[0104] The video management information includes object information describing such management information as 
where objects are recorded on disc, and presentation control information describing the playback sequence of the 
objects. 

[0105] Fig. 15A shows an example in which the objects recorded to the disc include PS_VOB#1 - PS_VOB#n 
TS1_VOB#1 - TS1_VOB#n, and TS2_VOB#1 - TS2_VOB#n. 

[0106] As shown in Fig. 15A a PS^VOB information table, TS1_VOB information table, and TS2_VOB information 
table are separately recorded according to the object types. Each of these tables stores VOB information for each object. 
[0107] The VOB information includes general information about the corresponding object, object attribute data, an 
access map for converting the object playback time to a disc address value, and management information for the 
access map. The general information includes identification information for the corresponding object and object re- 
cording time. The attributes include video stream attributes (V_ATR) such as the video stream coding mode, the number 
of audio streams (AST_Ns), and audio stream attributes (A_ATR) such as the audio stream coding mode. 
[0108] There are two reasons why an access map is required. The first is so that the playback path information avoids 
directly referencing object recording positions based on a sector address value, for example, and instead can indirectly 
reference object locations based on the object playback time. Object recording positions can change with RAM media 
as a result of editing the object, for example. This increases the amount of playback path information that must be 
updated if the playback path information references object recording positions directly based on the sector address 
If the objects are referenced indirectly based on the playback time, however, it is not necessary to update the playback 
path information and only the access map needs to be updated. 

[0109] The second reason is that the audio stream typically has two reference bases, the time base and data (bit 
stream) base, but the correlation therebetween is not complete. 

[0110] For example, using a variable bit rate (a method of changing the bit rate according to the complexity of the 
image) is becoming the norm with MPEG-2 Video, an international standard for video stream encoding. In this case 
there is no proportional relationship between the amount of data from the stream start and playback time, and random 
access based on the time base is therefore not possible. An access map is used to resolve this problem by converting 
between the time base and data (bit stream) base. 

[0111] As shown in Fig. 15A, the presentation control information includes a user-defined playback path information 
table, original playback path information table, and title search pointer 

[0112] As shown in Fig. 16A there are two types of playback paths data: originally defined playback path information 
generated automatically by the DVD recorder to describe all objects recorded during object recording, and user-defined 
playback path information enabling a user to freely define a particular playback sequence. The playback path informa- 
tion is uniformly referred to as Program Chain Information (PGC information) on a DVD : the user-defined playback 
path information is referred to as the U_PGC information, and the original playback path information as the 0_PGC 
information. The U^PGC information and 0_PGC information are tables listing the cell information describing the cells 
in the object playback period. The object playback period indicated by the 0_PGC information is called an original cell 
(0_CELL), and the object playback period indicated by the U_PGC information is called a user cell (U_CELL). 
[0113] A cell indicates the object playback period using the object playback start time and playback end time; the 
playback start and end times are converting by the access map described above to the actual location where the object 
is recorded on disc. 

[0114] As shown in Fig. 16B, a cell group indicated by the PGC information defines a continuous playback sequence 
reproduced sequentially according to the order of entries in the table. 

[0115] Fig. 17 shows a specific relationship between objects, cells, PGC, and access map. 

[0116] As shown in Fig. 17 the original PGC information 50 contains at least one cell information entry 60, 61 , 62, 63. 
[0117] Each cell information entry defines the object to reproduce as well as the object type, and object playback 
period. The order of the cell information entries in the PGC information 50 defines the playback sequence of the objects 
defined by each cell when the objects are reproduced. 

[01 1 8] Each cell information entry (cell information 60, for example) includes a Type 60a indicating the type of specific 
object, an Object ID 60b identifying a particular object, and a start presentation time Start_PTM 60c and end presen- 
tation time End_PTM 60d in the object on the time base. 

[0119] During data playback the cell information 60 is sequentially read from the PGC information 50, and the objects 
specified by each cell are reproduced for the playback period defined by the cell. 

[0120] The access map 80c converts the start and end time information contained in the cell information to the object 
address on disc. 

[0121] This access map is the map information described above and is generated and recorded when the objects 
are recorded. The picture structure of the object data must be analyzed in order to generate the map. More specifically 
it is necessary to detect the l-picture location shown in Fig. 9, and detect the PTS and other time stamp information', 
that is, the l-picture playback time shown in Figs. 10A to 10C and Figs. 11 A to 11 C. 

[0122] Problems occurring when generating the PS.VOB, TS1_VOB, and TS2_VOB map information are described 
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next. 

[0123] As described with reference to Fig. 1 , the PS.VOB andTS1_VOB are primarily generated by the DVD recorder 
encoding a received analog broadcast to an MPEG stream. The l-picture and time stamp information is therefore auto- 
generated by the DVD recorder, the internal data structure of the stream is known to the DVD recorder, and the map 

5 Information can be generated with no problem. 

[0124] As also described with reference to Fig. 1 , the TS2_VOB is a received digital broadcast recorded directly to 
disc by the DVD recorder with no intermediate encoding. Because the recorder thus does not generate the time stamp 
information and determine the l-picture locations as it does when recording a PS_VOB, the DVD recorder does not 
know the internal data structure of the stream and must therefore detect this information from the recorded digital 

10 stream. 

[0125] To do this the DVD recorder detects the l-picture and time stamp information for the map information of a 
TS2_VOB recording a stream encoded externally to the recorder as follows. 

[0126] First, l-pictures are detected by detecting the random access presentation information of the TS packet ad- 
aptation field shown in Figs. 12A to 12D. The time stamp information is detected by detecting the PTS in the PES 
15 header. Note that the PCR from the adaptation field or the ATS indicating the TS packet arrival time at the DVD recorder 
can be used instead of the PTS for the time stamp. In any case, the DVD recorder detects l-picture locations based 
on information in a high level system layer and does not need to analyze the data structure of the MPEG stream video 
layer. This is because the system overhead required to analyze the video layer in order to generate the map information 
is great. 

20 [01 27] There are also cases in which system layer detection is not possible. The map information cannot be generated 
in such cases and it is therefore necessary to indicate that there is no valid map information. The DVD recorder indicates 
this using the map management information shown in Fig. 15B. 

[0128] The map management information shown in Fig. 15B contains map validity information and a self-encoding 
flag. The self-encoding flag indicates that an object was encoded by the DVD recorder, and thus indicates that the 

25 internal picture structure is known and that the map information time stamp information and l-picture location information 
is accurate. The map validity information indicates whether or not there is a valid access map. 
[01 29] Examples of when the system layer cannot be detected include when the adaptation field is not set and when 
the digital stream is not an M PEG transport stream. Various digital broadcasting standards and formats are used around 
the world, and there will naturally be cases in which the DVD recorder records objects for which it cannot generate a 

30 map. For example, if a DVD recorder designed for the Japanese market and recording digital broadcasts in Japan is 
used in the United States to record digital broadcasts in the United States, there will likely be cases in which the DVD 
recorder cannot generate a map for the recorded objects. 

[01 30] The DVD recorder can, however, sequentially reproduce from the beginning objects for which map information 
is not generated. In this case video from the recorded digital stream can be reproduced by outputting it through a digital 
35 interface to a STB appropriate to the stream. 

6. Basic operation of the playback function 

[0131] The playback operation of a DVD recorder/player for reproducing content recorded to an optical disc as de- 

40 scribed above is described next below with reference to Fig. 18. 

[0132] As shown in Fig. 18 the DVD player has an optical pickup 201 for reading data from the optical disc 100, an 
ECC processor 202 for error correction processing of the read data, a track buffer 203 for temporarily storing the read 
data after error correction, a PS decoder 205 for reproducing video objects (PS_VOB) and other program streams, a 
TS decoder 206 for reproducing digital broadcast objects (TS2_VOB) and other transport streams, an audio decoder 

45 207 for reproducing audio objects (AOB) ( a still picture decoder 208 for decoding still picture objects (POB), a switching 
means 210 for changing data input to the decoders 205 to 208, and a controller 211 for controlling the various parts 
of the player. 

[0133] Data recorded to the optical disc 100 is read by the optical pickup 201 , passed through the ECC processor 
202 and stored to track buffer 203. Data stored to the track buffer 203 is then input to and decoded and output by the 

so ps decoder 205, TS decoder 206, audio decoder 207, or still picture decoder 208. 

[0134] The controller 211 determines what data to be read based on the playback sequence defined by the playback 
path information (PGC) shown in Figs. 16A and 16B. Using the example shown in Figs. 16A and 16B, the controller 
211 thus first reproduces part (CELL #1) of VOB #1 , then part (CELL #2) of VOB #3, and finally VOB #2 (CELL #3). 
[0135] Using the cell information of the playback path information (PGC) shown in Fig. 17, the controller 211 an also 

55 capture the type of cell reproduced, corresponding objects, and the playback start and end times of the objects. The 
controller 211 inputs the data for the period of the object specified by the cell information to the appropriate decoder. 
[0136] The controller 211 also identifies the objects to be reproduced based on the Object ID of the cell information. 
The controller 211 also identifies the a cell, which is the playback period of the identified object, by converting the 
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start_PIM and End_PTM of the cell information to a disc address value by referencing the access map of the corre- 
sponding VOB information. 

[0137] A player according to this embodiment of the invention also has a digital interface 204 for supplying the AV 
stream to an external device. It is therefore possible to supply the AV stream to an external device through an IEEE 
1394, IEC 958, or other communications means. This is so that, for example, when the player does not have an internal 
decoder for decoding a TS2_VOB not encoded by the recorder/player the TS2_VOB can be output directly without 
decoding through the digital interface 204 to an external STB for decoding and presentation via the STB. 
[0138] When the digital data is directly output to an external device, the controller 211 determines whether random 
access playback is possible based on the map information shown in Fig. 15B. If the access point data flag (random 
access presentation flag) is valid, the access map contains l-picture location information. In this case the controller 
21 1 is able to access and output digital data containing an l-picture to an external device through the digital interface 
in response to fast play and other requests from the external device. Furthermore, time-base access is also possible 
if the time access information flag is valid. In this case the controller 211 can access and output digital data including 
the picture data at a specified playback time to an external device through the digital interface in response to a time- 
's base access request from an external device. 

7. Basic operation of the recording function 

[0139] The configuration and operation of a DVD recorder according to the present invention for recording and re- 
producing an optical disc as described above is described next below with reference to Fig. 1 9. 
[0140] As shown in Fig. 19 the DVD recorder has a user interface 222 for receiving user requests and displaying 
information and prompts to the user, a system controller 212 handling the overall management and control of the DVD 
recorder, an analog broadcast tuner 21 3 for receiving VHF and UHF broadcasts, an encoder 21 4 for converting analog 
signals to digital signals and encoding the digital signals to an MPEG program stream, a digital broadcast tuner 215 
for receiving digital satellite broadcasts, an analyzer 21 6 for interpreting the MPEG transport stream sent from a digital 
satellite, a display unit 217 such as a television and speakers, and a decoder 218 for decoding the AV stream The 
decoder 218 has first and second decoders, for example, such as shown in Fig. 18. The DVD recorder also has a 
digital interface 21 9, track buffer 220 for temporarily storing write data, and a drive 221 for writing data to the disc. The 
digital interface 219 is an IEEE 1394 or other communications interface for outputting data to an external device. 
30 [0141] With a DVD recorder thus comprised the user interface 222 first receives a request from the user. The user 
interface 222 then passes the request to the system controller 212, and the system controller 212 interprets the user 
request and instructs the various modules to run appropriate processes. 

[0142] Recording includes self-encoding in which the DVD recorder encodes the input digital data, and outside en- 
coding for recording already encoded digital data to disc without further encoding. 

7.1 Recording operation by self-encoding 

[0143] Recording with self-encoding is described specifically first below using by way of example encoding and 
recording an analog broadcast to a PS_VOB stream. 

[0144] The system controller 21 2 sends a receive command to the analog broadcast tuner 21 3 and an encode com- 
mand to the encoder 214. 

[0145] The encoder 214 then video encodes, audio encodes, and system encodes the AV data from the analog 
broadcast tuner 213, and passes the encoded data to the track buffer 220. 

[0146] Immediately after encoding starts, the encoder 21 4 sends the time stamp information at the beginning of the 
MPEG program stream being encoded to the system controller 212 as the playback start time (PS_VOB_V_S_PTM), 
and parallel to the encoding process sends the data required to create the access map to the system controller 212 
This value is set as the Start_PTM of the cell information shown in Fig. 1 7 and generated later. The time stamp infor- 
mation is generally the PTS, but the SCR can be used instead. 

[0147] The system controller 212 then sends a record command to the drive 221 , and the drive 221 thus extracts 
and records data accumulated in the track buffer 220 to the DVD-RAM disc 100. A contiguous data area (CDA) as 
described above is also found in the recordable area of the disc and the data is recorded to the located contiguous 
data area. 

[0148] Recording typically ends when the user inputs a stop recording command. Stop recording commands from 
the user are input through the user interface 222 to the system controller 21 2, and the system controller 21 2 then sends 
a stop command to the analog broadcast tuner 213 and encoder 214. 

[0149] The encoder 214 stops encoding when it receives the stop encoding command from the system controller 
212, and sends the time stamp information of the last data in the last encoded MPEG program stream to the system 
controller 21 2 as the playback end time (PS_VOB_V_E_PTM). This value is set as the End.PTM of the cell information 
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shown in Fig. 17. The PTS is normally used for the time stamp information but the SCR can be used instead. 
[0150] After ending the encoding process the system controller 212 generates the presentation control information 
and VOB information (PS_VOBI) for the PS_VOB shown in Figs. 15A and 15B. 

[0151] The VOB information generated here includes map management information and an access map appropriate 
s to the object type. The system controller 21 2 sets the map validity information of the map management information to 
"valid," and sets the self-encoding flag ON. 

[0152] Original playback information (0_PGC information) as shown in Fig. 16A for the recorded object as one of 
the playback objects is generated as the presentation control information. This 0_PGC information is added to the 
original playback path table. The original playback path (0_PGC information) contains cell information. The cell infor- 
10 mation Type is set to PS_VOB. 

[0153] The system controller 212 then instructs the drive 221 to stop recording data accumulated in the track buffer 

220 [1910, sic] and to record the PS_VOB VOB information (PS_VOBI) and presentation control information. The drive 

221 thus records the remaining data in the track buffer220 and this information to the optical disc 1 00, and the recording 
process ends. 

15 [0154] It will be obvious that an analog broadcast could be encoded to TS1_VOB. In this case the encoder 214 must 
be an encoder for converting the analog signal to a digital signal and encoding the digital signal to the MPEG transport 
stream, and the cell information Type is set to TS1_VOB. 
[0155] The PTS or PCR can be used for the Start_PTM and End_PTM. 

20 7.2 Recording operation by outside encoding 

[0156] Recording with outside encoding is described specifically next below with reference to recording a digital 
broadcast. The recorded object type in this case is TS2_VOB. 

[0157] A digital broadcast recording request from the user is passed from the user interface 222 to the system con- 
25 troller 21 2. The system controller 21 2 then instructs the digital broadcast tuner 21 5 to receive and instructs the analyzer 
216 to interpret the received data. 

[0158] An MPEG transport stream sent from the digital broadcast tuner 215 is passed through the analyzer 216 to 
the track buffer 220. 

[0159] To generate the VOB information (TS2_VOBI) of the encoded MPEG transport stream (TS2_VOB) received 
30 as a digital broadcast, the analyzer 21 6 first extracts the time stamp information at the beginning of the transport stream 
as the start time information (TS2_VOB_V_S_PTM) and sends it to the system controller 212. This start time value is 
set as the Start_PTM of the cell information shown in Fig. 17 and generated later. The time stamp information is the 
PCR or PTS. The ATS indicating the timing at which the object is sent to the DVD recorder could alternatively be used. 
[0160] The analyzer 216 then analyzes the system layer of the MPEG transport stream to detect the information 
35 needed for access map generation. The l-picture locations in the object are detected based on the random access 
indicator (random_access_indicator) in the adaptation field of the TS packet header as described above. 
[0161] The system controller 212 then outputs a record command to the drive 221 , and the drive 221 thus extracts 
and records data accumulated in the track buffer 220 to the DVD-RAM disc 1 00. The system controller 2 1 2 also instructs 
the drive 221 where to record on the disc based on the allocation data of the file system. A contiguous data area (CDA) 
40 as described above is also found in the recordable area of the disc and the data is recorded to the located contiguous 
data area. 

[0162] Recording typically ends when the user inputs a stop recording command. Stop recording commands from 
the user are input through the user interface 222 to the system controller 21 2, and the system controller 21 2 then sends 
a stop command to the digital broadcast tuner 215 and analyzer 216. 

45 [0163] In response to the received stop command from the system controller 212, the analyzer 21 6 stops interpreting 
the received data and sends the time stamp information at the end of the last interpreted MPEG transport stream to 
the system controller 212 as the playback end time (TS2_VOB_V_E_PTM). This value is set as the End_PTM of the 
cell information shown in Fig. 17. The PCR or PTS is used for the time stamp information but the ATS indicating the 
time when the object was sent 1 to the DVD recorder can be used instead. 

50 [0164] After ending the digital broadcast reception process, the system controller 212 generates the presentation 
control information and VOB information (TS2_VOBI) for the TS2_VOB as shown in Figs. 1 5A and 1 5B based on the 
information received from the analyzer 216. 

[0165] The VOB information generated here includes map management information and an access map appropriate 
to the object type. The system controller 21 2 sets the map validity information of the map management information to 
55 "valid" if the l-picture locations in the objects were detected and the access map could be generated. The self-encoding 
flag is set OFF. If a valid access map could not be generated the map validity information is set to an "invalid" state. 
Examples of when a valid access map cannot be generated include when a corresponding digital broadcast is not 
received and when there is no random access data set in the adaptation field. If the signal is input directly through the 
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digital interface the signal may also not be an MPEG transport stream, and in this case, too, the map validity flag is 
set to "invalid." 

[0166] Original playback information (0_PGC information) as shown in Figs. 16A and 16B for the recorded object 
as one of the playback objects is generated as the presentation control information. This 0_PGC information is added 
5 to the original playback path table. The original playback path (0_PGC information) contains cell information. The cell 
information Type is set to TS2_VOB. 

[0167] The system controller 21 2 then instructs the drive 221 to stop recording data accumulated in the track buffer 

220 and to record the TS2_VOB VOB information (TS2_VOBI) and presentation control information. The drive 221 
thus records the remaining data in the track buffer 220 and this information to the optical disc 100, and the recording 

10 process ends. 

[0168] While the above recording operations are described with reference to user-input recording start and end 
commands, it will be obvious that the same essential operation applies to timer recordings controlled by a VCR, for 
example. In this case the system controller automatically issues the recording start and end commands instead of the 
user, and there is no essential change in DVD recorder operation. 

15 

8. Outline of the invention 

[0169] A data recording medium according to the present invention is a medium for recording data of various different 
formats, including analog broadcast or digital broadcast content and various types of data input through an analog/ 
20 digital interface. A data recording apparatus according to the present invention is an apparatus for recording AV data 
to and reproducing AV data from the same data recording medium. 

[0170] More particularly, externally input AV data is recorded as an MPEG_TS, and a stream adding decoder input 
time data for each MPEG_TS packet to each MPEG_TS packet is recorded to the data recording medium of the present 
invention. 

25 [0171] Recorder specific or content specific information and the locations of PSI (Program Specific Information) 
packets containing MPEG_TS control information are also embedded as a user private stream (UP packet), and the 
decoder input time of each packet is added in a format suitable for accumulation. 

[0172] Furthermore, so simplify conversion to an MPEG_PS when multiplexing the MPEG_TS, data less than one 
pack (2048 bytes) is system encoded as one continuous multiplexing unit, and an M PEG_TS is recorded while allocating 
30 each continuous multiplexing unit to one or plural MPEG_TS packets. 

9. Detailed embodiments of the invention 
First Embodiment 

35 

[0173] The basic recording and playback operations of a data recording and reproducing apparatus according to the 
present invention are substantially as described above, and only the basic operation for recording analog line input is 
therefore described specifically below with reference to Fig. 20. The recorded object type in this case is TS1_VOB. 
[01 74] Analog line input recording requests from a user are passed from the user interface 222 to the system controller 
40 212. The system controller 21 2 then sends a receive command to the line input unit 223 and a data encoding command 
to the encoder 214. 

[0175] The MPEG transport stream from the encoder 214 is sent to the track buffer 220. 

[0176] To generate the VOB information (TS1_VOBI) of the encoded MPEG transport stream (TS1_VOB), the en- 
coder 214 first sets the time stamp information as the presentation start time (TS1_VOB_V_S_PTM) and sends it to 
45 the system controller 212. This start time value is set as the Start_PTM of the cell information generated later and 
shown in Fig. 1 7. The time stamp information is the PCR or PTS. 

[0177] The encoder 214 also generates the data needed for access map generation while generating the MPEG 
transport stream. This is done by, for example, storing the adaptation field in the first MPEG transport packet of the I- 
picture, setting the random_access_indicator bit, and notifying the system controller 212 of the start of a VOBU. 
50 [0178] The system controller 212 then sends a record command to the drive 221 , and the drive 221 extracts and 
records data from the track buffer 220 to the DVD-RAM disc 100. The system controller 212 also instructs the drive 

221 where to record on the disc based on the allocation data of the file system. A contiguous data area (CDA) as 
described above is also found in the recordable area of the disc and the data is recorded to the located contiguous 
data area. 

55 [0179] Recording typically ends when the user inputs a stop recording command. Stop recording commands from 
the user are input through the user interface 222 to the system controller 21 2, and the system controller 21 2 then sends 
a stop command to the encoder 214. 

[01 80] I n responr : to the received stop command from the system controller 21 2, the encoder 2 1 4 stops the encoding 
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process and sends the time stamp information included in data at the end of the last encoded MPEG transport stream 
to the system controller 212 as the end presentation time (TS1_VOB_V_E_PTM). This value is set as the End_PTM 
of the cell information shown in Fig. 17. The time stamp information becomes PCR or PTS. 

[0181] After ending the recording process, the system controller 212 generates the playback control information and 
5 VOB information (TS1_VOBI) for the TS1_VOB as shown in Figs. 15A and 15B based on the information received 
from the encoder 214. 

[01 82] The VOB information generated here includes an access map and map management information those adapt- 
ed to the object type. The system controller 21 2 sets the map validity information of the map management information 
to "valid". The self-encoding flag is set ON. 
10 [0183] Original playback path information (0_PGC information) as shown in Figs. 16A and 16B for the recorded 
object as one of the playback objects is generated as the presentation control information. This 0_PGC information 
is added to the original playback path table. The original playback path information (0_PGC information) contains cell 
information. Type information of the cell information is set to 'TS1_VOB". 

[0184] The system controller 212 then instructs the drive 221 to stop recording data accumulated in the track buffer 
15 220 and to record the VOB information (TS1_VOBl) and playback control information for TS1_VOB. The drive 221 
thus records the remaining data in the track buffer 220 and this information to the optical disc 100, and the recording 
process ends. 

[0185] The self-encoding MPEG transport stream generated by the encoder 214 is described in further detail below. 
[0186] The structure of the self-encoding MPEG transport stream is shown in Figs. 21 A and 21 B. As shown in the 
20 figure the self-encoding MPEG transport stream is segmented into VOBU units. Each VOBU starts with a PAT packet, 
PMT packet, and a User Private packet (UP packet) embedded with stream-specific data. A PAT packet and PMT 
packet at least are also located at the beginning of the VOB. 

[0187] As shown in Fig. 21 B an ATS indicating the decoder input time is also added to each packet, and each packet 
is input to the decoder at the time intended by the ATS. 
25 [0188] The self-encoding program information (such as the PMT packet PID) is stored to the PAT packet of the first 
packet and input to the decoder at the time indicated by ATS1 . 

[01 89] The PID for each elementary stream composing the program is stored to the PMT packet of the second packet. 
In this example PIDs for the video, audio, data broadcast ("Data" in the figure), and user private ("private" in the figure) 
packets are stored. 

30 [0190] Information added to the stream is stored to the user private packet in the third packet. This added information 
could, for example, include: stream title information; recording date and time information; stream attributes, that is, 
stream encoding information such as the bit rate, video resolution, frame rate, aspect ratio, or encoding method; input 
source identification information for identifying whether the . line input is analog or digital; information indicating the 
AV data encoding method if the data is digital; copyright protection information indicating whether copying is allowed 

35 or prohibited; Vertical Blanking Interval (VBI) signals such as closed caption (CC) data, teletext data, or Wide_Screen 
Signaling (WSS) data used for display control; information indicating system encoding conditions; DVD standard com- 
patibility information; menu information provided for user convenience using specific data provided by the manufacturer 
that recorded the stream; and data useful for conversion to various DVD standard MPEG program streams (M PEG_PS). 
[0191] The decoder input time for a packet stored in this added information and located in the MPEG transport stream 

to as above is described next with reference to Figs. 22A and 22B. 

[0192] Fig. 22A is a block diagram showing the basic configuration of a decoder referred to as a transport stream 
system target decoder (T_STD). This figure further shows a system decoder 235 for interpreting a PS I packet and 
providing decoder control (not described above). 

[0193] When a PAT (PSI packet), PMT, or CAT (Conditional Access Table) packet, as PSI packet, is input to the 
45 T_STD, the packet is discriminated according to packet type by demultiplexer 232, and the PSI packet which is used 
for system control is sent immediately to a transport buffer 233. 

[0194] Data accumulated in the transport buffer 233 is then streamed to the system buffer 234 at a rate of 1 ,000,000 
bits/second (=Rsys). 

[0195] The PSI data becomes valid the moment the required PSI data is accumulated in the system buffer 234. 
50 [0196] This T_STD model in MPEG thus defines an operating model for the decoder and defines standards for the 
MPEG transport stream transfer rate, for example. 

[0197] There are several restrictions on PSI packet transfer because the data recording apparatus must self-encode 
the transport stream according to an MPEG transport stream format that assures the T_STD can correctly decode the 
transport stream. A method of determining the ATS that determines the packet transfer rate is described next with 
55 reference to Fig. 22B. 

[0198] When reproducing a self-encoding stream the leading PAT, PMT, and UP packets are input to the T_STD at 
the time indicated by ATS1 , ATS2, and ATS3, respectively. 

[0199] The PMT packet and UP packet are now considered, in order to interpret, by the T_STD, the PID of the UP 



15 



BNSDCCID: <EP 1422710A1 J_> 



EP1 422 710 A1 

packet specified by the PMT packet and valid it, the last byte (byte m) of the TS_program_map_section must be stored 
in the system buffer 234. 

[0200] That is, for the PMT to be valid (m+n+5) x 8 /Rsys seconds must have passed from ATS2 as the PMT packet 
input time. Note that n is the byte length of the PMT packet adaptation_field. 
5 [0201] Because the System Clock Frequency (SCF) as the T_STD reference clock is 27,000,000 Hz (with a defined 
tolerance range of ±810 Hz for error), the following relationship between ATS3 and ATS 2 must be true if the ATS is a 
time expressed to the precision of the System Clock Frequency. 

10 ATS3 > ATS2 + ((m+n+5)*8/Rsys)*SCF 

[0202] Because the shortest interval between ATS2 and ATS3 is only when there is no adaptation_field (n = 0) in 
the PMT packet and the smallest TS_program_map_section (21 bytes) is stored in the PMT packet, a time interval of 
208/Rsys x SCF is shortest. 

15 [0203] The following relationship is likewise required for the input time ATS1 of the PAT packet and input time ATS2 
of the PMT packet 

ATS2 > ATS1 + ((m0+n0+5)*8/Rsys)*SCF 

20 

where mO is the byte length of the Program association section in the PAT packet, and nO is the byte length of the 
adaptation_field in the PAT packet. 

[0204] Furthermore, because the shortest interval between ATS1 and ATS2 is only when there is no adaptation_field 
(n = 0) in the PAT packet and the smallest Program association section (1 6 bytes) is stored to the PAT packet, a time 
25 interval of 1 68/Rsys x SCF is shortest. 

[0205] If time is expressed with a precision of 27 MHz using a System Clock Frequency (SCF) of 27 MHz, the shortest 
time interval between ATS1 and ATS2 and between ATS2 and ATS3 is 4536 and 561 6, respectively. 
[0206] Storing the User Private packet to the self-encoding transport stream is described next with reference to Figs 
23 to 26. 

30 [0207] Fig. 23 shows storing the UP packet when the UP packet is defined as a User Private stream. In this case an 
identification number greater than or equal to "0x80" and less than or equal to "OxFF" is allocated to stream_type of 
the PMT corresponding to the UP packet. A unique PID is assigned to the UP packet. The internal data structure of 
the UP packet does not conform to the MPEG standard. Note that in this example the UP packet includes a section 
structure called the DVD_attribute_section(). 

35 [0208] Fig. 24 shows a further storage method whereby a private_section structure is included in the UP packet and 
a unique PID is assigned. The data structure of the private_section will vary somewhat according to the value of the 
section_syntax_indicator in the private_section, but data specific to the UP packet is stored in the private_data_byte 
of the private_section. In this case, identification number of 0x00 is assigned to stream_type. 

[0209] Fig. 25 shows a method of storing a UP packet as a packet with the same. PID as the PMT packet. In this 
40 case the UP packet data structure conforms to the private_section structure. The stream type is not defined, and PID 
of PMT packet is assigned to UP packet. 

[0210] Fig. 26 shows an example in which the UP packet is not stored separately but is enclosed in the PMT packet. 
In this case the specific data equivalent to the UP packet has a private_section structure, and the private_section is 
written after the TS_program_map_section . That is, PMT packet includes both TS_program_map_section and 
45 pri vate_sectio n . 

[0211] The specific data stored to the MPEG_TS by the above-noted methods is described next. 
[0212] As shown in Figs. 23 to 26, this specific data includes the Real-time Data Information General Information 
(RDI_GI) of the RDI Unit and the Display Control Information and Copy Control Information (DCI_CCI) of the DVD 
Video Recording standard. 

50 [0213] The RDI_GI stores the first presentation start time (VOBU_S_PMT) of the VOBU and the recording date and 
time information. The DCI_CCI stores, for example, the VOBU aspect ratio information, subtitle mode information, film 
or camera mode information and other information related to display control, copy generation management information, 
APS information, and input source information. (For further information about RDI_GI and DCI_CCI, see the DVD 
Video Recording standard.) 

55 [0214] The V_ATR field stores the video bit rate, resolution, frame rate (or video_format such as NTSC or PAL), 
aspect ratio, and encoding method (an MPEG2_Video or MPEG1_Video identifier). 

[0215] Likewise, the A_ATR field stores the bit rate for all or part of the audio, encoding method, channel count, 
quantization bits, and dynamic range control information according to the number of audio streams. 
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[0216] The CC field stores the closed caption data for the VOBU. To improve the transferability of PS conversion, 
closed caption data can be written in an extension_and_user_data (1) format (a method of storing user data to the 
GOP layer), or the closed caption data could be written separately. 

[0217] Storing the closed caption data to the user data of the GOP layer improves MPEG_PS conversion efficiency 
s because the DVD Video and DVD Video Recording standards are defined for this purpose. 

[021 8] The C_SE field stores information relating to some problems associated with TS2PS conversion of the VOBU 
or VOB. 

[0219] Regarding the CC, WSS, or teletext data storage location information, that information indicates whether, for 
example, closed caption data is contained in the UP packet, whether closed caption data is written as user data to the 
10 picture headers, or whether there is no closed caption data in the particular VOBU (or VOB). 

[0220] Regarding the WSS storage location information, that information further indicates whether it is stored as 
specific data in the UP packet, or whether it is written to the user data in the picture headers. 

[0221] Regarding the teletext storage location information, it indicates whether a TS packet is provided for storing 
the teletext data, or whether it is written to the user data in the picture headers. 

15 [0222] Regarding the multiplexed block structure and transfer information, that information includes information in- 
dicating if the number of TS packets in the multiplex block (a data block in which only one elementary stream is stored 
without being mixed with another elementary stream) as shown in Figs 27A to 27H is fixed or variable, the number of 
packets if the number is fixed, information indicating whether a PTS/DTS is added to the first TS packet in the multiplex 
block, or the transfer rate within the same multiplex block. During MPEG_TS encoding imposing no conditions on 

20 conventional multiplexing, the multiplex block can be written with a fixed length including only one TS packet. 

[0223] The decoder buffer control information includes vbv_delay, a parameter of the video verifying buffer, informa- 
tion such as vbv_buffer_size indicating the remaining video buffer capacity (this information is used to determine how 
far ahead of the ATS input time the video data can be read), and the time difference between the decoding time and 
the input completion time of the VOBU frame for which the buffer input time is closest to the frame decoding time (this 

25 information is used to determine how far back from the ATS input time the video or audio data can be read). 

[0224] The DVD_Compatibility information indicates the overhead involved with system transcoding a MPEG_TS to 
a MPEG_PS conforming to a DVD standard. 

[0225] The DVD_Compatibility information indicates how easy it is to convert a MPEG_TS to other DVD formats. 

For example, if the multiplex blocks are 2 KB or less, a level 1 indicator is set; if there is closed caption, WSS, or teletext 
30 data, the closed caption or WSS data is stored to an UP packet, and the teletext data is stored as a teletext packet in 

a multiplex block storing video data, a level 2 indicator is set; if it is not necessary to consider buffer management when 

the closed caption, WSS, or teletext data is stored to the area specified by the DVD standard, a level 3 indicator is set; 

and if it is not necessary to consider buffer management when the ATS of the first TS packet in the multiplex block is 

replaced by the SCR, a level 4 indicator is set. 
35 [0226] This DVD_Compatibility information is thus a data set indicating the ease of convertibility to various DVD 

formats, including DVD Video, DVD Audio, DVD Video Recording, and DVD Stream Recording. 

[0227] Figs. 27A to 27H show the structure of an MPEG_TS using multiplex blocks, and the data structure when this 
MPEG_TS is converted to DVD Video and DVD Video Recording formats. 

[0228] The self-encoded TS stream shown in Fig. 27A comprises the VOBU (playback and decoding units) of the 
40 self-encoded TS stream shown in Fig. 27B. As shown in Fig. 27C one VOBU includes multiple multiplex blocks (cor- 
responding to MPEG_PS packs). Each multiplex block can be segmented into fixed length data units as shown in Fig. 
27D (enabling easy packaging in the device) or into variable length data units as shown in Fig. 27E (thereby consuming 
less disc space). In the cases shown in Figs. 27D and 27E the multiplex blocks are respectively formed by segmenting 
non-elementary steams such as PSI/SI packets or UP packets and the elementary stream, but as shown in Fig. 27F 
45 a multiplex block could store both an elementary stream and non-elementary stream objects such as PSI/SI packets 
or UP packets. Note that in Fig. 27F multiplex block #1 and multiplex block #2 are one multiplex block. 
[0229] The above streams can be easily converted to the DVD Video format shown in Fig. 27G or the DVD Video 
Recording format shown in Fig. 27H. 

[0230] In this case it is important for simple TS2PS conversion that the MPEG_PS packs are formed in the multiplex 

50 block sequence and one multiplex block is the unit storing one pack of data. 

[0231] It should be noted that the capsule header and ATS are only loosely related to the present invention and are 
therefore omitted in Figs. 27A to 27H. In addition, the packs in the converted MPEG_PS shown in Figs. 27G and 27H 
are also stuffed or padded as appropriate according to the byte length and VOBU alignment of the stored elementary. 
[0232] Figs. 28A to 28G describes the multiplexing method of the present invention, comparing with the conventional 

55 stream multiplexing method shown in Fig. 8. As shown in the figure the final format conforms to the MPEG_TS format 
shown in Fig. 28G. The video stream (Fig. 28A) comprises plural GOP (Fig. 28B). Each GOP contains specific picture 
data, and a TS packet group of a data size equivalent to the data size of one pack when converted to an MPEG_PS 
is one multiplex block (Fig. 28C). That is, one multiplex block is segmented into plural TS packets equivalent to the 
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data size of one pack as shown in Fig. 28D. The audio stream is likewise packed in one multiplex block group having 
a plurality of TS packets. As sown in Fig. 28E, a VOBU is formed by multiplexing by multiplex block unit. The greatest 
difference between the present invention and the prior art shown in Fig. 8 is in that data units of a size equivalent to 
the data size of one MPEG_PS pack are grouped to form the multiplex blocks (see Fig. 28E). 
5 [0233] Furthermore, the ATS may be added to each MPEG_TS packet while increased by a specific amount (AATS) 
in each packet within the same multiplex block as shown in Fig. 29. This is effective to avoid complex buffer management 
during TS2PS conversion, and convert ATS to SCR using a simple offset or no offset. ATSi (i = 0, 1 , 2...) in this case 
satisfies the following equation. 

10 

ATSi + (packet count in the multiplex block) x AATS < ATSi+1 

[0234] When the multiplex block is a fixed length, the number of TS packets in one multiplex block is fixed and thus 
the multiplex block boundaries are easily known. However, when the multiplex block is variable length, the number of 

15 TS packets in one multiplex block is also variable and thus the multiplex block boundaries are not easily known . There- 
fore, the increase (AATS) in the ATS at the multiplex block boundary is set to a specific value different from the (constant) 
increase within the multiplex block. That is, the difference (AATS) between the ATS of the last packet in the previous 
multiplex block and the ATS of the first packet in the immediately following multiplex block is set to a specific value 
which is not the constant value. This makes it possible to know the multiplex block boundaries by monitoring AATS. A 

20 1:1 correlation between packs and TS packets when converting to an MPEG_PS can therefore be assured. ATSi in 
this case satisfies the following equation. 

ATSi + (packet count in the multiplex block) x AATS < ATSi+1 

25 

[0235] Furthermore, the ATSi added to the first packet in the MPEG_TS multiplex block corresponds to SCRi added 
to each pack in the MPEG_PS after conversion. 

[0236] Furthermore, as also shown in Fig. 29, closed caption, DSI, or other text information can also be stored in 
the UP packet. The DSI in the UP packet is used to generate NV_PCK data after conversion, and the closed caption 
30 data is stored to the video pack. To enable compatibility with the PAL standard used in Europe, packets storing teletext 
data in the multiplex block can be inserted between the video data packets as shown in Fig. 30. In this case the teletext 
data packets are located immediately before the simultaneously presented picture having the same PTS. After con- 
version the teletext data Is stored to the video pack. 

[0237] Fig. 31 shows the data structure of a UP packet storing the DSI as described above. 

35 [0238] Information (such as a relative number from the beginning of the VOBU) identifying the TS packet storing the 
last byte of the first l-picture in the VOBU can also be described in the added information of the UP packet to enable 
efficient special playback modes. Special playback modes can also be supported by also describing picture encoding 
type information of some of I- and P-pictures or all pictures in the VOBU, the data size of each picture (such as infor- 
mation identifying the TS packet containing the last byte), and information indicative of the DTS/PTS for each picture. 

40 [0239] It should be noted that if encoding is done so that TS packet containing the PTS/DTS is located at the begin ning 
of the multiplex block in the present embodiment, the beginning of an access unit will be located at the beginning of 
the packs after TS2PS conversion, and simplified DVD-specific header processing can be expected. 
[0240] To prevent an overflow of data stored to MPEG_PS packs and ease conversion to an MPEG_PS, the TS 
packets of the multiplex blocks can be appropriately stuffed or a necessary number of stuffing bytes can be inserted 

45 after the last TS packet in the multiplex block. 

[0241] The present embodiment has been described primarily with reference to recording to DVD, but the invention 
will obviously not be so limited. More specifically, after recording a self-encoded transport stream to a hard disk, sem- 
iconductor memory, or other data recording medium, a stream converted to an MPEG program stream can be recorded 
to the same medium or to a different medium. 

so [0242] Furthermore, the PAT, PMT, and UP packets are described as recorded to the beginning of each VOBU in 
the present embodiment, but they can be recorded to the beginning of at least a VOB or to the beginning of a Cell 
which is the playback management unit. 

[0243] Yet further, this embodiment is described recording PAT, PMT, and UP packets, but the UP packet can be 
omitted. 

55 [0244] Yet further, the PAT, PMT, and UP packets are described as fixed at the beginning in the present embodiment, 
but the invention shall not be so limited, and a packet storing a Null packet can be recorded inserted therebetween. 
[0245] Yet further, a self-encoded stream is described starting from a PAT packet, but the invention shall not be so 
limited and the stream could start from a Null packet. 
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[0246] Furthermore, the system transfer rate can be 'set to a fixed rate by appropriately inserting Null packets in the 
self-encoded stream. 

[0247] It should also be noted that a data area for storing manufacturer-private information can be provided as shown 
in Fig. 7, and MPEG_TS system encoding conditions can be written to this data area. 
5 [0248] It should also be noted that all or part of the information written to the UP packet in the above embodiment 
can be written to the TS1_VOB information shown in Fig. 15. 

[0249] It will also be noted that the DVD Video formal does not allow for dual mono audio. It is, however, possible 
to convert a self-encoding transport stream recorded with dual mono audio channels to the DVD Video format by 
separating the dual mono audio channels into two separate audio streams recorded as left and right monaural audio 
10 channels. 

[0250] Part or all of the parameters written to the UP packet in the above embodiment could also be written into the 
management information. By thus avoiding recording a parameterthat does not change within a self-encoding transport 
stream multiple times, recording space is not wasted and the decoder does not need to waste processing time trying 
to determine whether or not the parameter changed each time a UP packet is detected. 

15 

Second Embodiment 

< Encoder configuration > 

20 [0251] An alternative embodiment of the present invention is described next below. The description is made to an 
encoder of a data recording apparatus according to the present invention by focusing first the encoding process to 
receive and self-encode AV input to an MPEG transport stream. 

[0252] Fig. 33 shows the configuration of the encoder in a data recording apparatus according to the present inven- 
tion. As shown in the figure the encoder 21 4 includes elementary stream encoders 230a, 230b and 230c, and a system 
25 encoder 232. The encoder 214 receives a control signal from the system controller 212 and then runs the encoding 
process with the elementary stream encoders 230a, 230b and 230c, or the system encoder 232 while switching between 
elementary encoding and system encoding. Each of the elementary stream encoders 230a, 230b and 230c receives 
video, audio, and VBI (Vertical Blanking Interval) signals for encoding. 

[0253] The video encoder 230a receives a control signal from the system controller 21 2 and based thereon encodes 
30 the bit rate, resolution, aspect ratio, and other attributes of the video stream within a predefined range. More specifically, 
the video encoder 230a receives a control signal from the system controller 212 specifying the operating mode as the 
"DVD Video compatible mode," DVD Video Recording compatible mode," or "normal mode." If the mode specified by 
the control signal is the DVD Video compatible mode, the video encoder 230a generates a video stream conforming 
to the video attributes of the DVD Video standard; if the DVD Video Recording compatibility mode, it generates a video 
35 stream conforming to the video attributes of the DVD Video Recording ("DVD VR" below) standard; and if the normal 
mode, generates a video stream conforming to a specific attribute range. 

[0254] The audio encoder 230b likewise receives a control signal from the system controller 21 2 and based thereon 
encodes the bit rate, quantization rate, channel count, and other attributes of the audio stream within a predefined 
range. Like the video encoder 230a, the audio encoder 230b specifically receives a control signal from the system 
40 controller 21 2 specifying the operating mode. If the mode specified by the control signal is the DVD Video compatibility 
mode, the audio encoder 230b generates an audio stream conforming to the audio attributes of the DVD Video standard; 
if the DVD VR compatibility mode, it generates an audio stream conforming to the audio attributes of the DVD Video 
Recording ("DVD VR" below) standard; and if the normal mode, generates an audio stream conforming to a specific 
attribute range. 

45 [0255] The VBI data encoder 230c likewise receives a control signal specifying the operating mode from the system 
controller 212 and encodes the VBI data accordingly. Specifically, if the elementary stream encoding control signal 
input from the system controller 212 to the VBI data encoder 230c indicates the DVD Video compatible mode or DVD 
VR compatible mode, it additionally encodes VBI data according to the VBI data storage method specified by the 
respective standards. There is a case that a VBI data storage method is separately defined even in the original normal 

50 mode, and in that case "additionally encode" means that VBI data is redundantly stored to the elementary stream. 
[0256] The encoded elementary streams are then multiplexed to the MPEG_TS system stream by the system en- 
coder 232. 

[0257] Like the elementary stream encoders 230a, 230b and 230c, the system encoder 232 also receives an encoding 
control signal from the system controller 212 to encode according to the received signal. 
55 [0258] The control signal from the system controller 212 to the system stream encoder 232 is either a system en- 
coding control signal for encoding a normal MPEG_TS, or a system encoding control signal applying constraints on 
the normal MPEG_TS in order to enable easy conversion to an MPEG_PS (particularly a specific DVD format). 
[0259] If the control signal is for encoding a normal MPEG_TS, the system stream encoder 232 applies the system 
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encoding to the elementary streams input from the elementary stream encoders 230a, 230b and 230c while managing 
the buffers so that the input streams are not corrupted by the decoder model (T_STD) which is a reference for the 
MPEG_TS system stream. 

[0260] If the control signal from the system controller 212 is a control signal specifying system encoding to an 
M PEG_TS enabling easy conversion to an MPEG_PS, the encoding is conducted while also following additional special 
system encoding rules. 

[0261] The encoder 214 then outputs the resulting self-encoding MPEG_TS system stream. 

[0262] The data recording apparatus according to the present invention is thus characterized by switching the en- 
coding mode at the elementary stream and system stream encoding levels. The processes applied in each encoding 
mode to convert to a particular DVD format when the encoding mode is changed as described above are shown in the 
table in Fig. 34. 

[0263] An M PEG_TS enabling easy conversion to an MPEG_PS is thus generated by driving the elementary stream 
encoders 230a, 230b and 230c and system encoder 232 to encode the respective streams assuming the conversion 
toanMPEG_PS. 

< A self-encoded MPEG_TS > 



[0264] A detailed embodiment of the format of an MPEG_TS self-encoded by a data recording apparatus according 
to the present invention is described next below. The differences between a normal MPEG_TS ("SESF" below) and 
an MPEG_TS enabling easy conversion to an MPEG_PS (a "Constrained SESF" below) are also described. 
[0265] In the following example, information presenting the stream encoding conditions is stored to the VOBI storing 
attributes and other information in MPEG_TS stream units. By thus storing information about the encoding conditions 
to the management information and not in the stream, it is possible to quickly determine whether a stream can be easily 
converted to a DVD Video or DVD VR format without analyzing the stream. Note that this information presenting the 
25 stream encoding conditions can be stored to a Tip packet which is described further below. 

[0266] The information presenting the stream encoding conditions is represented by an "encode_condition" flag 
which has two bits. The flag value is described below. 



00b: normal MPEG_TS (SESF) 

01b: MPEG_TS enabling easy conversion to a DVD VR stream format (Constrained SESF) 
10b: reserved 

11b: MPEG_TS enabling easy conversion to a DVD Video stream format (Constrained SESF) 

[0267] Two cases are possible if the encode_condition flag is set to 00b in the stream management information: the 
stream is originally encoded without considering high speed conversion to MPEG_PS, and a sequence of MPEG pro- 
gram streams are linked by user editing for easy conversion to individual MPEG program streams. 
[0268] If the encode__condition flag is also set in the stream, it is meaningless to set encode_condition = 00b indicating 
a normal MPEG_TS in the stream. It is therefore also possible for the encode_condition flag to be used differently 
inside and outside the stream, reserving the encode_condition = 00b setting so that it is not used in the stream (in the 
40 Tip packet described below). 

[0269] By thus setting this flag, it is possible to determine from the value of the VOBI encode_condition field whether 
the stream can be easily converted to a DVD Video or DVD VR format. "Easily converted" as used herein means 
convertible by the conversion method described further below. 

45 < Constrained SESF stream structure > 

[0270] Fig. 80 shows the complete stream structure of a Constrained SESF A Constrained SESF includes plural 
SESF capsules 200. An SESF capsule 200 contains specific multiplexing units 21 0, and a Tip packet (detailed below) 
at the head. The presentation time stamp (PTS) of each SESF capsule 200 and an address of the Tip packet are 
correlated in the access map 80c. As described below, for TS2PS conversion, a conversion process is accomplished 
in SESF capsule units. 

[0271] Fig. 32 shows the correlation between MPEG_PS packs and packets in on SESF capsule. As shown in Fig. 
32 a TS packet (Tip packet below) storing stream-specific information is inserted to a Constrained SESF. The Tip 
packet embedded in a Constrained SESF is described below with reference to Fig. 35 to Fig. 41 . 



< Tip packet > 

[0272] Fig. 35 shows the overall Tip packet structure. As shown in Fig. 35, a Tip packet stores a DataJD identifying 
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the packet as a Tip packet, display_and_copy_info corresponding to the DVD VR DCl_CCI field and including display 
control and copy control information, encode_info storing stream encoding information, and Makers Private Data for 
storing additional information unique to the manufacturer. 

[0273] As shown in Fig. 35 and Fig. 36, the PCR value needed for the SCR calculations described below is written 
5 to the adaptation field of the Tip packet. This adaptation field is a fixed byte length, and thereby enables accessing 
information in the Tip packet using a fixed address. 

[0274] The DataJD structure is shown in Fig. 37. The DataJD has a Datajdentifier for identifying whether the 
corresponding packet is a Tip packet. The Datajdentifier is a 3-byte field holding a value of ,, 0x544950 B expressing 
Tl P" in the ASCII code. The decoder of the playback device reads the value of this field to identify that it is a Tip packet. 
10 [0275] The display_and_copy_info structure is shown in Fig. 38. Generating the RDI pack when converting a Con- 
strained SESF to the DVD VR format is simplified by providing the same structure and information as the DCI_CCI 
field of the RDI Unit in the DVD VR standard in display_and_copy_info. (Note that the DCI_CCI field of the DVD VR 
standard is fully described in "DVD Specifications for Rewritable/Re-recordable Disc, Part 3, VIDEO RECORDING," 
and in Japanese patent No. 3162044. While some of the field names are different in these documents, the field defi- 
es nitions are the same so as to enable direct copying when converting to the DVD VR format.) 

[0276] The encode_info field structure is shown in Fig. 39. Resolution information for the video stream following the 
Tip packet is written to the video_resolution field. The value of encode_info is shown below. 

0000b: 720 x 480 (NTSC), 720 X 576 (PAL) 
20 0001b: 704 x 480 (NTSC), 704 x 576 (PAL) 

0010b: 352 x 480 (NTSC), 352 x 576 (PAL) 

0011 b: 352 x 240 (NTSC), 352 x 288 (PAL) 

0100b: 544 x 480 (NTSC), 544 x 576 (PAL) 

0101b: 480 X 480 (NTSC), 480 x 576 (PAL) 
25 Others: reserved 

[0277] Resolution can vary during a single continuous recording in the DVD VR format. However, streams of different 
resolutions are managed as separate VOBs and it assures seamless connection during playback by a certain recorder. 
This field is therefore used to determine where it is necessary to split the VOB when converting to the DVD VR format, 
30 if there is a resolution change during Constrained SESF recording. 

[0278] In a Constrained SESF recorded with consideration for converting to the DVD Video format (encode_condition 
= 11b), the resolution does not change within a single stream. 

[0279] The encode_condition field is the same as the value stored to the VOBI (except when 00b). The reason why 
the encode_condition field is stored and embedded in the stream and not only in the stream management information 
35 is to enable the recorder to easily determine if it is possible to convert to the DVD format by referencing the 
encode_condition field in the Tip packet when, for example, a stream is copied through a digital interface such as IEEE 
1394. 

[0280] VOBU_S_PTM of the DVD VR standard is recorded to the FVFPST field. This is to eliminate the process of 
analyzing the video stream encoded after the Tip packet and calculating the presentation time of the first appearing 
40 video field when converting a Constrained SESF to a DVD Video or VR format. 

[0281] The FVFPST field includes a 32-bit field denoting the video field presentation time at 90 KHz precision, and 
a 16-bit field denoted at 27 MHz precision. 

[0282] The PESJnfo structure is shown in Fig. 40. PESJnfo is needed to convert a Constrained SESF to the DVD 
Video format without analyzing the elementary streams. This information is needed to generate the information inserted 
45 to the DVD Video stream and stored in the packs, referred to as NV_PCK, supporting special playback modes. 

[0283] The PESJnfo can store information for 136 PES packets each storing video data or audio data units. Four 
bits are assigned to each PES packet, and the NV_PCK information can be generated without analyzing PES packet 
content. PES packets not storing video or audio data are ignored. 

[0284] In a SESF capsule being the data unit from one Tip packet to the packet immediately preceding the next Tip 
50 packet, a PES_existence_flag declares if the j-th PES packet is present in the SESF capsule. The vaule of 
PES_existence_flag is set as follows. 

0b: j-th PES packet is not in the SESF capsule 
1b: j-th PES packet is in the SESF capsule 

55 

[0285] If the PES_extension Jlag = 0b (when there is no PES packet), all remaining fields in the PES packet are set 
to 0b. 

[0286] The PES_payload_identifier identifies whether the data stored in the PES packet is video or audio data. 
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PES_payload_identifier values are set as follows. 



Ob: 
1b: 



video stream 
audio stream 



5 



[0287] The PES_existence_flag and PES_payload„identifier fields are set for all relevant PES packets. 

[0288] When it is determined from the PES_payload identifier whether video or audio data is stored, the remaining 

field definitions vary according to the type of stream stored in the PES packet. 

[0289] If the PES packet stores a video stream (PES_payload_identifier = Ob), picture_coding_type indicating the 
type of picture stored in the PES packet is defined after the PES_payload_identifier field. 
[0290] The value of the picture_coding_type field is set as follows. 

00b: a picture encoded with encoding other than 01b or 10b 

01b: a frame encoded 1 -picture; a pair of field encoded l-pictures; or a pair of field encoded l-picture and field 
encoded P-picture 

1 0b: a pair of frame encoded P-pictures or a pair of field encoded P-pictures 
11b: reserved 

[0291] In other words, a picture with 01b or 10b is a picture used as the reference picture defined by the DVD Video 
standard. The above description is for information added to PES packets storing video. 

[0292] If the PES packet stores an audio stream (PES_payload_identifier = 1b), the PES_payload_identifier is fol- 
lowed by a strearnjdentifier and a sync_presentation_flag. The strearnjdentifier identifies whether the audio stream 
in the PES packet is a first audio stream or a second audio stream. The sync_presentation_flag is a flag to identify 
whether there is an audio frame for which presentation begins simultaneously to or immediately following the FVFPST 
field (the presentation start time of the video field presented first) written to each Tip packet. 
[0293] The value of strearnjdentifier is set as follows. 

0b: first audio stream 
1b: second audio stream 



[0294] The first and second audio stream can be discriminated by the PID setting rules and the order of elementary 
stream declaration in the EMT 

[0295] The value of sync_presentation_flag is set as follows. 

0b: an audio frame for which presentation begins simultaneously to or immediately following the FVFPST is not 
stored in the audio PES packet 

1b: an audio frame for which presentation begins simultaneously to or immediately following the FVFPST is stored 
in the audio PES packet 

[0296] Information added to PES packets storing audio is as described above. 

[0297] The PESJnfo field thus extracts and stores information for each PES packet following a Tip packet. 
[0298] Fig. 41 shows the MakersPrivateData. As shown in the figure : the MakersPrivateData has a makerJD field 
identifying the manufacturer of the Constrained SESF, and maker_private_data field containing specific additional in- 
formation described by the manufacturer. 

[0299] Figs. 42A and 42B shows an example of a value of PID of the Tip packet and a value of stream Jype indicating 
the stream type. Other PID and stream_type values are reserved by the MPEG standard and other standards, and 
these values were selected to indicate private data beyond the scope of the MPEG standard without interfering with 
reserved values. 

[0300] Various stream attribute information is thus extracted and stored to the Tip packets stored in a Constrained 
SESF. How the fields described above are used during conversion to a different DVD format is described in further 
detail below. 

< System encoding conditions > 

[0301] The system encoding conditions for a Constrained SESF are described in detail next below. Note that the 
following system encoding conditions do not apply to a normal SESF. 
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< Multiplexing Unit > 

[0302] TS packet storing elementary streams in a Constrained SESF is composed of a multiplexing unit which is a 
unit of data stored in 2 KB packs according to a DVD format. Note that this multiplexing unit corresponds to the multiplex 

5 block of the first embodiment. 

[0303] Only TS packets storing one type of elementary stream are stored to each multiplexing unit, and these TS 
packets are not mixed with TS packets storing another type of elementary stream. Mixing TS packets with Null packets 
is not prohibited because it may be necessary to include one or more Null packets in order to generate a multiplexing 
unit (such as the multiplexing unit storing the last part of a stream). This is necessary to clarify the relationship between 

10 multiplexing units and packs. 

[0304] One multiplexing unit contains eleven continuous TS packets, and the elementary stream (payload data) in 
each multiplexing unit is stored completely within the one corresponding pack. This likewise constrains the relationship 
to the pack. 

[0305] TheTS packet storing the PES packet header is placed at the beginning of the multiplexing unit. This correlates 
15 the packet header of the pack (the "PES packet header" in the MPEG.TS) and the PES packet header of the Con- 
strained SESF and enables easy conversion during sequential processing of each TS packet when converting to a 
DVD format pack. 

[0306] When the PES packet storing the video stream is segmented and placed in multiple multiplexing units, all 
multiplexing units other than the multiplexing unit containing the last byte of the PES packet store a TS packet payload 

20 data of 1 84 x 1 1 = 2024 bytes. This allows stream transfers to be completed most efficiently and successive processing 
by TS packet unit to be easily accomplished during TS2PS conversion. If the size of multiplexing units other than the 
last multiplexing unit is less than 2024 bytes, it will not be possible to easily determine the value of the 
PES„packet_length field stored to the packet header of each pack in the MPEG_PS when converting the first TS 
packet of the multiplexing unit during TS2PS conversion. 

25 [0307] A PES packet storing an audio stream starts in the first TS packet in one multiplexing unit and ends in the 
same multiplexing unit. This is easy to understand by considering storing PES packets storing the audio stream to 
multiple multiplexing units. If one audio PES packet is segmented and placed in multiple multiplexing units, it is nec- 
essary to identify the PTS and determine the number of audio frames stored in one pack in order to generate the packet 
header when converting the second and later multiplexing units to MPEG_PS packs. In order to do this it is then 

30 necessary to analyze the internal structure of the audio stream. 

[0308] The multiplexing unit is defined above. Encoding to generate a Constrained SESF involves system encoding 
within the constraints of the above-described multiplexing unit. 

< Constraints of PES packet headers in a Constrained SESF > 

35 

[0309] A number of constraints on the field values of the PES packet header in a Constrained SESF are described 
next. 

[0310] As shown in Fig. 43, some PES packet header fields allow only fixed values. This is to prevent creating 
unnecessary processes when converting to different DVD formats. "Unnecessary processes" means processing fields 
40 additionally created or deleted by values different from values defined by the DVD format. In other words, the purpose 
of this PES packet header constraint is to minimize fields added to or deleted from the header during TS2PS conversion. 
[0311] A value of 0 is permitted for the PES_packet_length field when a video stream is stored to the MPEG_TS. 
[0312] The PTS_DTS_flags field indicates if a PTS or DTS is included. 

[0313] When the PES packet stores an audio frame, at least one or more audio frames starts in the PES packet, 
45 and PTS_DTS_flags is set to 1 0b (to 11 b if the DTS is written). 

[0314] Constraints for sequentially processing by TS packet unit during TS2PS conversion are applied to 
PES_extension_flag and PES_header_data_length. These are shown in Fig. 44. 

[031 5] As shown in Fig. 44, specific values are defined according to the elementary stream type : PES packet location, 
and encode_condition value. 

50 [031 6] Note that VPD in Fig. 44 is the combined byte length of the PTS field and DTS field in the PES packet. That is, 

if PTS_DTS_flags = 00b, VPD = 0; 
if PTS_DTS_flags = 10b, VPD = 5; 
if PTS_DTS_flags = 11b, VPD = 10. 

55 

[0317] As described above, this constraint is necessary to simplify sequential processing by TS packet unit without 
forming the packs after determining the payload length of one pack when converting to the DVD Video or VR format. 
[0318] The PES packet header is defined above. An encoder generating the Constrained SESF encodes the system 
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stream within the above constraints. 

< Constraints on the Tip packet insertion interval > 

[0319] Constraints on the insertion interval of Tip packets inserted to a Constrained SESF are described next. 
[0320] The following relationship must be true for the decoder input time indicated by ATS (ATS1 ) of the Tip packet 
and the decoder input time indicated by ATS (ATS2) of the TS packet storing the video or audio stream input to the 
decoder first after the Tip packet. 

ATS1 + T< ATS2 
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T = (PS_pack_size*8*system_clock__frequency) / PSrate 

where T is the shortest PS pack transfer period. This shortest transfer period is the shortest period from the start to 
the end of PS pack input to the system decoder. That is, the above equation shows that the ATS interval of each TS 
packet must at least be greater than the interval at which PS packs after conversion can be input to the system decoder 
[0321] The value of T is determined as follows. 

[0322] PS_pack_size is the byte length of one pack in the MPEG_PS generated by TS2PS conversion, the 
system_clock_f requency is the frequency of the reference clock of the M PEG_PS decoder, and PSrate is the multiplex 
rate of the MPEG_PS stream generated by TS2PS conversion. 

[0323] These values are defined as below by the DVD format, and the relationship between ATS1 and ATS2 is 
therefore as follows. 



PS_pack_size = 2048 bytes 
30 system_clock_frequency = 27,000,000 Hz 

PSrate = 10,080,000 bits/second 

35 

ATS1 + 43885.714... < ATS2 

Therefore, 

ATS1 + 43886 = ATS2 

defines the minimum value of ATS2. The TS2PS conversion described below typically converts a Tip packet to a 2-KB 
NV_PCK (in DVD Video conversion) or RDI.PCK (in DVD VR conversion) pack. However, if the above relationship is 
not satisfied, the next elementary stream is transferred earlier and may exceed the upper limit of the DVD system 
transfer rate (1 0.08 Mbps). 

[0324] It should be noted that while these effects are obtained by providing a period in which AV data is not sent 
only after Tip packet transfer they can also be achieved by providing the above time interval between AV data transfers 
before and after a Tip packet. 

[0325] An integer number of GOPs are aligned in one SESF capsule. This is to make the SESF capsule correlate 
to a VOBU of the DVD format so that the VOBU concept of the DVD format can also be realized in the Constrained 
SESF. More particularly, a VOBU must include an integer number of GOPs according to the DVD format (DVD VR) 
[0326] Video data stored in one SESF capsule must be at least 0.4 second and not more than 1 .0 second wide on 
the playback time base. In addition, the time width on the playback time base of video data stored in the last SESF 
capsule is greater than or equal to 0.4 second and less than or equal to 1 .2 second when the encode_condition = 11 b 
(DVD Video mode), and when the encode_condition = 01 b (DVD VR mode) must be less than or equal to 1 .0 second 
This is because the SESF capsule becomes a VOBU, and must conform to the specific DVD format. 
[0327] Each Tip packet normally preferably has a 1 :1 correlation on the access map used for time-address conver- 



24 

BNSDOCID: <EP 1 42271 OA 1 I > 



EP1 422 710 A1 



sion. This is required so that conversion can start immediately with the VOBU units defined by the DVD format during 
TS2PS conversion, and so that the DSI (Data Search Information) (which provides address information for the adjacent 
VOBU stored in the NV_PCK) can be generated from the access map when Tip packets are converted to NV_PCK 
packs during conversion to the DVD Video format. The DSI can be calculated insofar as the access map stores the 
5 playback time (part or all of the AV playback time information immediately after the Tip packet according to FVFPST) 
for each Tip packet and recording address of each Tip packet, and the number of multiplexing units stored between 
two consecutive Tip packets is known. This is achieved by imposing the following constraints. 

[0328] It should be noted that all Tip packets need not be pointed to from the access map. For example : AV data 
following the last Tip packet in a Constrained SESF does not contain playback time length information nor have a next 
10 Tip packet, is thus different from other Tip packets and is therefore handled differently. In this case, no particular adverse 
affect on playback and conversion even if the last Tip packet is not registered in the access map, and thus it can be 
handled in an exceptions process in consideration with the device implementation. 

[0329] A total thirty-two packets not associated with a multiplexing unit are inserted between two consecutive Tip 
packets. This is because it is necessary to determine how many packs there will be in a VOBU when converted to a 
15 DVD format using the access map during TS2PS conversion. (The number of packets need not be limited to 32, but 
there must be some specific number of packets. Because the number of TS packets following a Tip packet can be 
determined from address information of the Tip packet in the access map, the number of packs included in a VOBU 
when converted to a DVD format can be determined if the number of packets that are not multiplexing units is known. 
This is important.) 

20 [0330] Furthermore, the reason there are 32 packets is as follows. It could be sufficient that there are at least 31 
PAT, PMT, PCR, and SIT packets between two consecutive Tip packets, because: the PAT, PMT packets describing 
the MPEG_TS program configuration data must be embedded at least once every 100 msec; a SIT packet storing 
specific information for each program must be embedded at least once every 1 second; the PCR packet storing the 
PCR (program clock reference) establishing the decoder reference time must be embedded at least once every 100 

25 msec; Null packets not belonging to any multiplexing unit can be freely added; and the Tip packet insertion interval is 
1 .0 second or less on the AV data playback time base. Therefore, count of the VOBU pack can be determined from 
the access map by inserting PAT, PMT, PCR, and SIT packets between two consecutive Tip packets according to these 
defined times, and adding Null packets until there are 32 packets. 

[0331] Consider, for example, the number of packs after conversion when a Tip packet is inserted at 0.5 second 
30 intervals and there are 121 0 TS packets following a Tip packet identifiable from the access map. In this case there is 
a total of 15 (=5+5+5) PAT, PMT, and PCR packets, 1 SIT packet inserted after this Tip packet, and 16 Null packets 
inserted to achieve a total 32 packets. When this is then converted to DVD format, the Tip packet is converted to an 
NV_PCK (when converted to DVD-Video) or RDI_PCK (when converted to DVD VR) as one pack, and one multiplexing 
unit (11 TS packets) is converted to one pack, respectively The count of VOBU pack can therefore be denoted as 
35 1 + (number of multiplexing units). 

The number of multiplexing units is 

(number of TS packets following that Tip packet - 33)/11 . 
In this example, therefore, there are 

1 + ((1 21 0-33)/1 1 ) = 1 +1 07 = 1 08. 

It thus can be determined that the VOBU has a total 108 packs. If the number of packs in each VOBU and the pres- 
entation start time information is known, the DSI packet of the N V_PCK required for conversion to DVD Video can be 
45 generated very quickly. 

[0332] The constraints on the Tip packet insertion interval are as described above. The encoder generating the 
Constrained SESF encodes the system stream within the above constraints. 

< Constraints on decoder control > 

50 

[0333] Constraints on decoder control (buffer management) of Constrained SESF are described next below. 
[0334] A Constrained SESF must be generated to satisfy the standard of T_STD that is the decoder reference model 
for an MPEG_TS. This means that the Constrained SESF can be decoded by a set-top box, for example, having a 
T_STD conforming decoder if the stream types match. 
55 [0335] The MPEG_TS standard decoder model T_STD and the MPEG_PS standard decoder model P_STD are 
substantially the same in operation and processing capabilities, but the audio stream input rate to the decoder differs. 
More specifically, the transfer rate of the T_STD from the transport buffer before the audio decoder to the audio buffer 
is 2 Mbps (except for AAC) (see Fig. 18). The P_STD, however, inputs each stream to the decoder at the system rate, 
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which with DVD is 10.08 Mbps. 

[0336] This means buffer management of a Constrained SESF and DVD format cannot be the same. 
[0337] The same buffer management thus cannot be used with an MPEG_TS and MPEG_PS. However if the SCR 
(System Clock Reference) indicating the decoder input time of the pack after conversion can be calculated using the 
5 ATS added to each TS packet while avoiding system encoding with re-consideration of buffer management during the 
conversion from a Constrained SESF to DVD format, very quick and easy conversion can be achieved. Calculating 
the SCR using the ATS is described in detail further below. 

[0338] Furthermore, the Constrained SESF of the present invention must be encoded so as to assure that it conforms 
to the T_STD and also that the MPEG_PS generated by the conversion method described further below conforms to 
10 P_STD. 

[0339] More specifically, the Constrained SESF is a stream encoded to a MPEG_TS so that it also conforms to the 
P_STD after conversion to an MPEG_PS. 

[0340] These are the constraints on Constrained SESF buffer management. It should be noted that a normal SESF 
is simply encoded to conform to the T_STD without considering these constraints. 
15 [0341] Examples of MPEG_TS and MPEG_PS that do not conform to the standard T_STD and P_STD models are 
described below. 

[0342] First, an example of a MPEG_TS self-encoded such that it can be converted to an MPEG_PS but does not 
conform to the T_STD model is shown in Fig. 45. Stream TS1 is an MPEG transport stream applied with system- 
encoding to conform with the T_STD model. Stream TS2 is an MPEG transport stream that does not conform to the 

20 T_STD model. More specifically, in the stream TS2, the values of ATS [47] to ATS [57] are set above the transfer rate 
allowed for MPEG_TS audio data. The audio transport buffer thus overflows (Fig. 1 8) and the stream does not conform 
to the T_STD model. In stream TS1 , however, the values of ATS [47] to ATS [57] are set within the transfer rate allowed 
for MPEG_TS audio data. This stream can therefore be correctly converted to a P_STD conforming MPEG program 
stream PS1 using the SCR conversion equation described below. Furthermore, while stream TS2 does not meet the 

25 T_STD standard, PS1 can be generated by conversion using the below SCR conversion equation. For conversion 
from stream TS2 to MPEG_TS conforming with a T_STD, the audio packet transfer time interval specified by ATS [47] 
to ATS [57] must 'be increased so that a transport buffer overflow does not occur. 

[0343] Figs. 46A and 46B shows an example in which the T_STD model is satisfied but the MPEG_PS converted 
from an MPEG_TS does not satisfy the P_STD model. Stream TS3 is an MPEG transport stream, and stream PS3 is 

30 an MPEG program stream converted from MPEG transport stream TS3. Fig. 46B shows the change in the state of a 
buffer for video data for each stream during decoding. The PES #1 picture is decoded at time SCR [2], and the PES 
#2 picture is decoded between SCR [4] and SCR [5]. As shown in Fig. 46B, transfer of TS packet data in transport 
stream TS3 is completed by the time picture data in PES #1 and PES #2 is decoded. With program stream PS3, 
however, V_PCK #1 data transfer for PES #1 is in time, but transfer of V_PCK #4 for PES #2 data is late for decoding 

35 and a buffer underflow occurs because decoding starts while the data transfer is in progress. The requirements of the 
P - STD model are therefore not met. This can be avoided by setting the value of the ATS field (ATS [14], ATS [25], 
ATS [36]) for each TS packet converted to V_PCK #2 to V_PCK #4 temporally before the PES #2 picture decoding 
time so that transferring the MPEG_TS PES #2 picture data is completed earlier. 

«> < ATS-SCR conversion > 



[0344] Calculation method of the SCR of PS packet when converting a Constrained SESF stream to a program 
stream is described below. The SCR must be calculated in order to generate a new pack, and is therefore necessary 
only when converting Tip packets and the first TS packet in a multiplexing unit. 

[0345] The structure of a Constrained SESF stream is as shown in Fig. 14C. A PCR packet storing reference time 
information (program clock reference PCR) and/or a Tip packet is appropriately inserted to aTS packet, and this can 
be used to reset the decoder reference time STC (system time clock) at an appropriate time interval. Each TS packet 
also contains an ATS storing the relative transfer time information between each TS packet. Therefore, TS packets 
output after the TS packet storing the PCR are input to the decoder at a timing determined from the PCR and ATS 
indicating the relative transfer time between TS packets. In other words, the decoder input time (the "calculated_PCR" 
below) of each TS packet can be generated forTS packets from the TS packet storing the PCR. If no TS packet stores 
the PCR, information equivalent to the PCR can be extracted to the management information. 
[0346] Fig. 47 shows the relationship between the calculated_PCR and SCR when converting a Constrained SESF 
to MPEG_PS, i.e., a head of the SESF capsule shown in Fig. 80. The ATS assigned to each TS packet in ascending 
order from the stream start is denoted ATS [k]. The PCR calculated in order of appearance for the first TS packet in a 
multiplexing unit is denoted calculated_PCR [i] (i = 0, 1 , 2...). The SCR of the pack after conversion is likewise denoted 
SCR[i]. 

[0347] As noted above, video stream transfers are constrained by the maximum transfer rate of 1 5 Mbps (in the case 
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of MP@ML, the transfer rate from the multiplexer buffer to the video buffer does not exceed 15 Mbps) and the audio 
stream input rate is lower than the video transfer rate. (Except for AAC, the transfer rate from the transport buffer to 
the audio buffer does not exceed 2 Mbps.) Multiplexing units storing audio data are therefore different from multiplexing 
units storing video data and are transferred at a lower rate. Therefore, if the video data transfer rate is raised to near 
5 the 9.8 Mbps maximum rate of the DVD format, TS packets of video data must be transferred at a rate above the DVD 
transfer rate (10.08 Mbps) in order to assure sufficient time for audio data transfers which occur at a lower rate and 
therefore take more time. 

[0348] That the transfer time differs in Constrained SESF and the DVD format will be known from Fig. 47. 
[0349] The following relationship must be true for the decoder arrival time (calculated_PCR) of the first TS packet 
10 in a multiplexing unit or Tip packet, and the SCR of the pack after that packet is converted. 

SCR[0] = calculated_PCR[0] 

SCR[i] = max(SCR[i-1] + T, calculated_PCR[i]) (i= 1 ,2, 3, ...) 
calculated_PCR[i] = PCR_tip + (ATS[i] - ATS_tip + WA*BS) 
15 T = PS_pack_size*8*system_clock_frequency / PSrate 

where PCR_tip and ATS_tip are the PCR value and the ATS of the Tip packet immediately before the converted mul- 
tiplexing unit. WA indicates how many times overflow occurred (described further below) in a range between ATS_tip 
and the ATS (ATS [i]) assigned to the first TS packet in the i-th multiplexing unit. BS denotes the amount of one overflow 
20 in ATS. max(a.b) is a function for selecting the greater of a or b. 

[0350] In the SCR [i] (i = 0, 1, 2, 3, ...) relation, PS_pack_size is the byte length of one pack in the MPEG_PS 
generated by the TS2PS conversion as noted above, system clock_frequency is the frequency of the MPEG_PS de- 
coder reference clock, and PSrate is the multiplex rate of the MPEG_PS generated by TS2PS conversion. That is, 

25 PS_pack_size = 2048 bytes 

system_clock_frequency = 27,000,000 Hz 
PSrate = 10,080,000 bits/second 

[0351] There are, therefore, two patterns for transferring packs after the first pack: transferring the pack after a 
30 minimum transfer time determined by the transfer rate passes from the transfer time of the one preceding pack, or 
transferring the pack at the decoder input time of the first TS packet in the pack. For pack transfers at a time before 
the time when the video data is converted to the DVD format, packs are transferred at the minimum transfer time 
interval noted above. For example, when packs are transferred in a time band preceding video data conversion to the 
DVD format, the former method of transferring packs after waiting the minimum transfer time determined by the transfer 
35 rate from the time when the preceding pack was transferred is selected. 

[0352] It should be noted that because a Constrained SESF can be edited, the calculated_PCR [0] will not go to 0 
even when encode_condition = 11b when the beginning of the stream is deleted by editing, for example, and it is 
possible to reset encode_condition to "00b" . 

[0353] However, if calculated_PCR [O] is permitted to be set to a non-zero state while encode_condition = 11b, this 
40 problem can be resolved by applying the following conversion equation only when encode_condition = 11b. 

SCR[0] = 0 

SCR[i] = max( SCR[i-1] + T, calculated_PCR[i] ) - calculated_PCR[0] (i= 1 ,2, 3, ...) 
calculated_PCR[i] = PCRJip +- (ATS[i] - ATS_tip + WA*BS) 
45 t = PS_pack_size*8*system_clock_frequency / PSrate 

PTS(DVD-Video) = PTS(Constrained SESF) - calculated_PCR[0] 
DTS(DVD-Video) = DTS (Constrained SESF) - calculated_PCR[0] 

[0354] In other words, to conform to the DVD Video format, SCR [0] is set to 0, values of subsequent SCRs are offset 
so values, and all PTS and DTS in the DVD Video stream are offset by a uniform time of calculated_PCR [0] using the 
result of the above conversion equation offset time calculated_PCR [0]. 

[0355] By thus uniformly offsetting the time information of the stream, conversion to the DVD Video format while 
keeping encode_condition = 11b is possible even when the beginning of the Constrained SESF (encode_condition = 
11b) has been deleted. 

55 [0356] PTS and DTS values may be converted during conversion to the DVD Video format, but this can be easily 
achieved by sequentially processing the TS packet units. 

[0357] The SCR is calculated from the ATS based on the above equation during TS2PS conversion. The program 
stream output by TS2PS conversion must conform to the P_STD model as noted above, and this means that SCR 
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values are restricted to a particular range. The ATS values assigned to each packet of a Constrained SESF must 
therefore be set according to the ATS-SCR relation shown above. 

< Elementary stream constraints > 

[0358] Constraints on the elementary streams of a Constrained SESF are described next. 

[0359] Because re-encoding the elementary streams imposes an extremely heavy burden on the encoder, only 
MPEG-2 Video is allowed for video data while AC-3, MPEG-1 Audio, and LPCM are allowed for audio data. 
[0360] The Constrained SESF described here excludes LPCM, however. This is to avoid the danger of needing to 
re-encode the elementary stream when LPCM uses a quantization rate of 20 bits or more, and to simplify buffer man- 
agement by reducing the amount of audio data for which the transfer rate cannot be raised. If 16-bit LPCM is used, 
however, there is no particular need to exclude LPCM audio. 

[0361] Streams permitted for the Constrained SESF described here are MPEG-2 Video for the video data and two 
types of audio data, AC-3 and MPEG-1 Audio. 

[0362] In normal SESF which is not Constrained SESF, encoding of audio data is not limited to the above. Encoding 
method such as AAC (Advanced Audio Coding) which is used in BS digital broadcasting can be used. 
[0363] Elementary stream attributes when encode_condition = 11b are shown in Fig. 48. 

[0364] Because the attributes shown in the figure are set to assure interchangeability at the elementary stream level 
between DVD Video and DVD VR, a Constrained SESF (encode.condition = 11b) conforming to these attributes does 
not require elementary stream re-encoding when converted to DVD Video or DVD VR formats, and high speed con- 
version is therefore possible. 

[0365] Elementary stream attributes when encode^condition = 01b are shown in Fig. 49. 

[0366] Because the attributes shown in the figure are set to assure interchangeability at the elementary stream level 
with DVD VR, a Constrained SESF (encode_condition = 01b) conforming to these attributes does not require elemen- 
tary stream re-encoding when converted to DVD VR format, and high speed conversion is therefore possible. 
[0367] Notes 1 to 4 in Fig. 48 and Fig. 49 are described below. 

Note 1 : This attribute cannot change within the same VOB. 

Note 2: This attribute can change in the TS packet storing the first elementary stream following the Tip packet. In 
other words, it can change only in the first video or audio TS packet in an SESF capsule. 

Note 3: sequence_end_code cannot be inserted between sequence_header fields having the same 
horizontaLsize, vertical_size, and aspect_ratio_information. 
Note 4: This attribute can change within the same VOB. 

[0368] Constraints relating to the elementary streams of a Constrained SESF are described above. 

[0369] Adding the encoding conditions described above makes it possible to generate a Constrained SESF that can 

be easily and quickly converted to a DVD format. 

< DVD Video and DVD VR format after conversion > 

[0370] The field settings of the DVD Video and DVD VR formats to which the Constrained SESF is to be converted 
are described next. 



<DVD Video format> 

[0371 ] A stream conforming to the DVD Video standard is described briefly first below. The DVD Video stream format 
is described in detail in "DVD Specifications for Read-Only Disc, Part 3, VIDEO SPECIFICATIONS." 
[0372] The stream structure of the DVD Video format is shown in Fig. 50. As shown in the figure each stream contains 
multiple VOBs and each VOB contains an integer number of VOBU. A VOBU includes an integer number of packs 
starting with a NV pack (V_PCK) followed by a video pack (V_PCK) an audio pack (A_PCK). Unlike a normal DVD 
pack, the NV_PCK contains two packets. These packets are called the PCI (Presentation Control Information) and 
DSI (Data Search Information) packets, respectively The playback control information for the corresponding VOBU is 
stored to the PCI packet. Information useful for special playback modes, such as the relative positions of the VOBU 
to neighboring VOBU, is stored to the DSI packet. These fields are described next below in conjunction with how the 
field values are determined. 

[0373] Fig. 51 shows the structure of NV_PCK PCI data. The PCI data includes PCLGI (PCI General Information) 
storing general information for PCI, NSML_AGLI as angle information for non-seamless presentation, HLI as informa- 
tion for adding highlighting to menus and buttons, and RECI storing the International Standard Recording Code (ISRC). 
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[0374] When converted from Constrained SESF, NSML_AGLI and HLI store a value indicating an invalid. 
[0375] The ISRC field can store a value indicating an invalid or a ISRC code as it is, but this field is irrelevant to 
conversion from Constrained SESF and further description is therefore omitted. The only field that a problematic with 
respect to creating PCI data from a Constrained SESF is therefore the PCIJ3I field. 
5 [0376] Fig. 52 shows the structure of the PCI_GI field in NV_PCK. Note that calculation methods are described below 
only for those fields that must be calculated during conversion from a Constrained SESF. 
11/17 

[0377] NV_PCK_LBN (the relative address of NV_PCK in the VOBS file) can be determined by the data recording 
apparatus which counts each pack number during conversion. 
10 [0378] VOBU_CAT (information of the analog copy protection state) can be obtained from the display_and_copy_info 
of the Tip packet corresponding to the NV_PCK. 

[0379] VOBU_S_PTM (presentation time information for the video field presented first in the VOBU) can be calculated 
from the FVFPST of the Tip packet corresponding to the NV_PCK. 

[0380] VOBU_E_PTM (time information when presentation of video data in the VOBU ends) can be obtained from 
15 the presentation time information written to the next entry in the access map, or it can be generated by analyzing the 
video stream of the VOBU and calculating the time at which video presentation ends. 

[0381] VOBU_SE_E_PTM (time information when presentation of video data in the VOBU ends according to the 
sequence_end_code field) is filled with "OxOOOOOOOO" in all VOBUs before the last VOBU, because the 
sequence_end_code is only permitted in the last VOBU and the middle VOBU therefore do not contain the 

20 sequence_end_code. VOBU_SE_E_PTM is set to the same value as in VOBU_E_PTM in the last NV_PCK only. 

[0382] C_ELTM is the time difference between the presentation time of the first video frame presented in a cell storing 
NV_PCK and the video frame first presented in the VOBU, and must be expressed with frame precision. C_ELTM can 
be calculated as needed by the data recording apparatus during the conversion process using the FVFPST of the 
corresponding Tip packet and the presentation time information of the video frame presented at the beginning of a CELL . 

25 [0383] NV_PCK PCI data can thus be generated as needed by VOBU unit during conversion as described above. 
[0384] Fig. 53 shows the DSI structure of the NV_PCK. As shown in Fig. 53 the DSI data field includes: DSI J3I (Data 
Search Information General Information) storing general DSI information; SML_PBI (Seamless Playback Information) 
storing recording address and playback information needed for seamless presentation between VOBs; SML_AGLI 
(Angle Information for seamless) storing location information needed for seamless presentation between different an- 

30 gies and so on; VOBU_SRI (VOB Unit Search Information) storing the recording address of VOBU adjacent to a par- 
ticular VOBU; and SYNCI (Synchronous Information) enabling synchronous presentation of video with audio/subpic- 
tures. 

[0385] When converted from a Constrained SESF, SML_AGLI stores information indicating invalid. 
[0386] Fig. 54 shows the DSI_GI structure of an NV_PCK. Note that calculation methods are described below only 
35 for those fields that must be calculated during conversion from a Constrained SESF. 

[0387] NV_PCK_SCR (the SCR of the NV_PCK) is deduced from the SCR deduced from the ATS of the Constrained 
SESF by the method described further below. 

[0388] NV_PCK_LBN (relative address of the NV_PCK in the VOBS file) is the same as the PCI data. 

[0389] VOBU_EA (relative address from the NV_PCK to the last pack in the VOBU) can be calculated from the 

40 access map. As described above, the number of packets not belonging to a multiplexing unit between two consecutive 
Tip packets is known (fixed). Therefore, the number of TS packets to the next entry (the next Tip packet) can be 
calculated from the access map. The number of TS packets in that TS packet not belonging to a multiplexing unit then 
subtracted, and the difference is then divided by 11 to determine the number of packs formed after the NV PCK. The 
number of packs generated after conversion can be written to the NV_PCK derived from the last Tip packet or to all 

45 NV_PCK. 

[0390] VOBU_1STREF_EA (relative address in the VOBU from NV_PCK to the last pack in the first referenced 
picture), VOBU_2NDREF_EA (relative address in the VOBU from NV_PCK to the last pack in the second referenced 
picture), and VOBU_3RDREF_EA (relative address in the VOBU from NV_PCK to the last pack in the third referenced 
picture) can be determined without analyzing to the video stream layer if the Tip packet PESJnfo field is referenced 
50 during TS2PS conversion. 

[0391] PESJnfo stores the picture_coding_type indicating the type of encoding applied to the picture stored in each 
video PES packet. A PES packet with a picture_coding_type of 01b or 10b stores a reference picture as defined in the 
DVD Video standard. 

[0392] It is therefore possible to reference the PESJnfo field during TS2PS conversion to determine if the PES 
55 packet being converted stores a reference picture, and the pack in which said converted PES packet ends becomes 
the last pack of the reference picture. 

[0393] Because the last pack of a reference picture can be identified during conversion, it is also possible while 
generating the VOBU to determine in which pack the first, second, and third reference pictures end, and write a relative 
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address to the end of each said picture in the VOBU_1STREF_EA, VOBU_2NDREF_EA, and VOBU_3RDREF_EA 
fields of the first NV_PCK in the VOBU. 

[0394] VOBU_VOBJDN (the ID number of the VOB to which the VOBU belongs) should be obtainable by the data 
recording apparatus during conversion. When one Constrained SESF is being converted, VOB segmentation due to 
5 the stream conditions, such as an attribute change, is prevented and the same ID number can be assigned by setting 
the Constrained SESF encode_condition to 11b. 

[0395] Like VOBU_VOB_IDN, VOBU_C_IDN (the ID number of the CELL to which the VOBU belongs) is set by' the 
data recording apparatus during conversion, and is not related to the stream. If the CELL is intentionally segmented 
based on the PGC information or other management information for the Constrained SESF, a number determined by 
io the segmentation is simply assigned. 

[0396] C_ELTM is the time difference between the presentation time of the first video frame presented in a cell storing 
NV_PCK and the video frame first presented in the VOBU, and must be expressed with frame precision. C_ELTM is 
the same as the C_ELTM written to the PCI data. 

[0397] Each field of the DSI_GI field in the NV_PCK can thus be continuously generated by VOBU unit during con- 
's version as described above. 

[0398] Fig. 55 shows the structure of the SML_PBI field in NV_PCK. Note that calculation methods are described 

below only for those fields that must be calculated during conversion from a Constrained SESF. 

[0399] VOB_V_S_PTM (presentation time of the first video frame presented in the VOB to which NV_PCK belongs) 

can be determined from the FVFPST of the first Tip packet. 
20 [0400] VOB_V_E_PTM (video presentation end time in the VOB to which NV_PCK belongs) can be determined by 

analyzing the stream after the last Tip packet in the part of the Constrained SESF selected for conversion before the 

actual TS2PS conversion. 

[0401] It is thus possible to calculate the fields of SML_PBI of NV_PCK before conversion. 

[0402] As noted above VOBU_SRI can be calculated using the access map, and further description thereof is thus 
25 omitted here. 

[0403] Furthermore, VOBU_SRI is written completely within each cell and thus cannot be determined if the cell is 
not defined. Therefore, a recorder that records in the DVD Video format in real-time cannot create cells at any desired 
interval and thus suffers from degraded editing and playback performance. When converting from a Constrained SESF, 
however, cells can be defined as periods specified by the user and converted using the method described above, 
30 chapters can be created as intended by the user, and a play list that starts playback from a user-defined point can be 
created conforming to the DVD Video format. 

[0404] Fig. 56 shows the structure of the SYNCI field of NV_PCK. Note that calculation methods are described below 
only for those fields that must be calculated during conversion from a Constrained SESF 

[0405] A_SYNCA0 is the relative address of a pack storing a primary audio pack and storing the audio frame pre- 
ss sented simultaneously to or immediately after VOBU_S_PTM. It can be determined using the PESJnfo in the Tip 
packet without analyzing the stream during TS2PS conversion. 

[0406] Whether a PES packet stores primary audio can be determined by reading the streamjdentifier of the 
PESJnfo, and at the next sync_presentati on_f lag it is possible to determine whether there is an audio frame presented 
simultaneously to or immediately after VOBU_S_PTM in the audio frame contained in the PES packet. Therefore, if 
*o the PES packet contains primary audio and the sync_presentation_flag = 1b, the address from the NV_PCK to the 
pack storing the PES packet can be written during TS2PS conversion. 

[0407] It should be noted that there is no guarantee that the sync_presentation_flag = 1 b will be set in one audio 
pack of the VOBU . If the encoder multiplexes the audio first, the audio pack presented simultaneously to or immediately 
after VOBU_S_PTM of the VOBU could be stored in the preceding or the following VOBU. 
45 [0408] The value set to the A_SYNCA0 field must therefore be determined during conversion with a correct under- 
standing of the sequential relationship between the PES packet of the primary audio (the sync_presentation_flag = 
1b) and the subsequently generated NV_PCK. 

[0409] To eliminate this process the Constrained SESF can be system encoded so that the audio data presented 
simultaneously to or just after the FVFPST written to the first Tip packet in the SESF capsule is also stored in the same 
50 SESF capsule. 

[0410] By using these definitions a process for detecting audio data synchronized to VOBU_S_PTM (FVFPST) out- 
side the VOBU (SESF capsule) can be eliminated. 

[041 1 ] A_SYNCA1 is the relative address of a pack storing a secondary audio and storing the audio frame presented 
simultaneously to or immediately after VOBU_S_PTM, and can be determined using the same method as A_SYNCA0. 
55 [0412] Except for A_SYNCA, it is thus possible to sequentially generate DSI data of NV_PCK by VOBU unit during 
conversion. 

[0413] An example of NV_PCK generation is as shown in Fig. 82. 
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<DVD Video Recording format> 

[0414] Field settings during conversion to the DVD Video Recording (VR) stream format are described next below. 
[0415] The DVD VR stream is described briefly below. Note that the DVD VR stream format is described in detail in 

5 "DVD Specifications for Rewritable/Re-recordable Discs, Part 3, VIDEO RECORDING." 

[0416] Fig. 57 shows the stream structure of the DVD VR format. As shown here each stream includes multiple 
VOBs, and each VOB contains an integer number of VOBUs. A VOBU includes an integer number of packs, starting 
with an RDI_PCK followed by a video pack (V_PCK) and an audio pack (A_PCK). Unlike a normal pack, the RDI_PCK 
contains presentation and copy control information, and manufacturer-specific information. The fields contained in the 

10 RDI_PCK are described below together with how the field values are determined. 

[0417] As shown in the figure, the RDI_PCK payload data (RDI Unit) includes: RDl_GI (Real-time Data Information 
Genera! Information) storing general information of RDI, DCI_CCI (Display Control Information and Copy Control In- 
formation) storing information used for display and copy control, and MNFI (Manufacturer's Information) storing man- 
ufacturer-specific information. 

15 [0418] The RDI_GI field contains a VOBU_S_PTM field. Only this field is variable and the other field values are fixed. 

[0419] VOBU_S_PTM has the same format as the FVFPST written to the corresponding Tip packet in the transport 

stream before conversion, and the FVFPST value can therefore be simply copied to the VOBU_S_PTM field. 

[0420] DCLCCI has the same format as the display_and_copy_info field of the Tip packet, and the 

display_and_copy_info value can therefore be simply copied to the DCLCCI field. 
20 [0421] A specific manufacturer ID is allocated only when the makerJD written to the Tip packet is identical to the 

manufacturer ID of the data recording apparatus, and the manufacturer-specific information is copied to the MNFI field. 

However, if the makerJD in the Tip packet is the ID for a different manufacturer, or the makerJD is invalid, the RDI 

pack can be generated by writing invalid data to the MNFI field. 

[0422] It is possible that part of the data written to the Tip packet is invalid. In this case a flag (an invalidation flag) 
25 indicating that there is invalid data in the Tip packet should be set. If this invalidation flag is set to ON, the flag must 
be updated after updating the invalid data in the Tip packet to the most recent data. 

[0423] As an example of this, considered can be a case where the most recent CCI data and a TS packet CCI data 
invalidation flag are present in the ATS (4B) of each TS packet. 

[0424] In this case it is necessary to determine if the invalidation flag is set during TS2PS conversion. If it is, it is 
30 necessary to convert to an RDI_PCK using data updating the CCI data in the display_and_copy_info field with the CCI 
flag of the ATS. 

[0425] RDI_PCK can thus be sequentially generated using only the corresponding Tip packet (and ATS thereof). 
[0426] Fig. 58 is a flow chart of the above RDI_PCK generation process. 

[0427] In a RDl_PCK (or NV_PCK), the system header includes fixed-value fields. Details of the system header are 
35 shown in Fig. 61 . The packet header and private header stored to the RDI_PCK are shown in Figs. 62A and 62B, 
respectively. Because these headers include fixed-value fields as shown in the figures, they can be easily generated. 
[0428] Fig. 59 is a flow chart of a process for generating PS packs from TS packets (multiplexing unit) storing AV data. 
[0429] As shown in the figure, TS packets of a Constrained SESF storing AV data are converted using one multi- 
plexing unit as the unit of processing to 2KB packs of an MPEG_PS storing AV data. This is further described below 
40 following the steps of this process. 

[0430] (Step S4200): One TS packet is read from the conversion starting point of the Constrained SESF stream. 
[0431] (Step S4201): Whether the read TS packet stores AV data and is the first TS packet in a multiplexing unit is 
determined. 

[0432] Whether AV data is stored is determined by reading the PID value of the TS packet declared in the PMT to 
45 store AV data. The TS packet is determined to be at the beginning of a multiplexing unit when the preceding TS packet 

'is a Tip packet, PSI/SI packet, or PCR packet and the TS packet following immediately thereafter stores AV data. 

Because a Tip packet is expected at a conversion starting point, whether it is at the beginning of a multiplexing unit 

can be determined by sequentially reading the TS packet (that is, the first TS packet storing AV data immediately after 

a Tip packet is always at the beginning of a multiplexing unit). If the TS packet is determined to not be at the beginning 
50 of a multiplexing unit, or if conversion does not start from a Tip packet and the determination cannot be made, control 

loops back to step S4200 to read the next TS packet. Control moves to the next step after the beginning of a multiplexing 

unit is found. 

[0433] (Step S4202) : Using the ATS assigned to the TS packet at the beginning of the multiplexing unit, the time 
(the PCR) at which the MPEG_PS pack to which the TS packet is converted will be input to the decoder is calculated. 
55 The PCR is calculated as described above. Once the PCR is calculated the SCR can be determined by the method 
described above, and the pack header shown in Fig. 60 is completed. This is because except for the SCR only fixed 
values are permitted in the pack header. 

[0434] (Step S4203): The packet header and private header are determined. 



31 



BNSDCCID: <EP 1 42271 OA 1J_> 



EP1 422 710 A1 

[0435] The packet header is created based on the PES packet header of the Constrained SESF. The form of the 
packet header must satisfy the field values shown in Fig. 63. This is because conversion from the Constrained SESF 
will not be determined uniformly if field values that will change the header length are not set, and buffer management 
may be affected. Field not shown here are fixed values and are therefore not listed. 
5 [0436] Individual field values of the PES packet header are determined specifically with the Constrained SESF in 
order to minimize the processing required for PES packet header (MPEG_TS) to packet header (MPEG_PS) conver- 
sion. 

[0437] If the PES packet is large relative compared to the size of one pack, one PES packet will be converted to 
plural packs. In this case major revisions to the packet headers of the second and subsequent packs include setting 
10 PTS_DTS_flags in the first packet header generated from the PES packet to 00b, the PES_extension_flag to Ob, 
adjusting the stuff in g_byte length, and correcting PES_header_data_length. 

[0438] The private header is required when a non-MPEG stream is stored, and is therefore required in packs storing 
NV_PCK, RDI_PCK, AC-3, or LPCM. 

[0439] Fig. 64 shows the private header of an AC-3. Of the fields shown in the figure, only the 
15 number_of_frame_headers field requires calculating during TS2PS conversion according to the Constrained SESF 
multiplexing unit definition. Because this field specifies the number of AC-3 audio frames stored in the pack, the field 
value can be easily calculated from the PES_packet_length for fixed-rate AC-3, for example, because the byte length 
of one audio frame is calculable from the bit rate and the value is a fixed length. 

[0440] It should be noted that the PES_header_data_length of the PES packet header of a Constrained SESF is 
20 stuffed with an extra 4 bytes by the AC-3 private header (4 bytes). (See Fig. 44.) By thus estimating before conversion 
the header length after conversion and shifting the payload position, sequential process in units of TS packet can be 
easily done. 

[0441] As described above, the first packet header is generated by correcting a part of the PES packet header, the 
second and later packet headers is generated by correcting a part of the first packet header, and the private header 
25 is inserted only when AC-3 audio data is stored. The packet header and private header can thus be generated. 

[0442] (Step S4204): Once the private header is generated, the PS pack payload is filled from the beginning thereof 
by simply copying data from the TS packet payload. 

[0443] (Steps S4205 to S4207): These steps simply repeat until the multiplexing unit (1 1 TS packets) is completed. 
However, because it is possible that a Null packet has been inserted, TS packet payload data is copied while the Null 
30 packet PID (0x1 FFF) is detected. 

[0444] Preferably either only the last TS packet in a multiplexing unit has an adaptation field, or only the TS packet 
storing the last data of the PES packet stores the adaptation field has an adaptation field. This makes reading the 
payload data easier because all TS packets other than the last TS packet in the multiplexing unit stores at least 184 
bytes of payload data. 

35 [0445] (Step S4208): When all multiplexing unit payload data has been copied, the byte length of the resulting pack 
is calculated to confirm if a byte length is 2048 bytes. Pack generation ends if there are 2048 bytes. If the pack contains 
less than 2048 bytes, control steps to S4209. 

[0446] (Step S4209): If the pack does not contain 2048 bytes, a padding packet is added to the end of the payload 
to a total of 2048 bytes. 

40 [0447] The conversion process thus proceeds from a multiplexing unit storing AV data. This process repeats only if 
a multiplexing unit is detected until processing the part of the Constrained SESF selected for conversion ends. 
[0448] The result of the above conversion process applied to packs of different types is described next below. 

<Conversion to a video pack (V_PCK)> 

45 

[0449] Figs. 65A and 65B show the conversion from a Constrained SESF to MPEG_PS. As shown in Fig. 65A, one 
video PES packet is normally larger than 2 KB, and is therefore segmentedto plural multiplexing units and multiplexed 
to a Constrained SESF. 

[0450] Based on the Constrained SESF definition each multiplexing unit other than the last multiplexing unit in a 
50 video PES packet is filled with the greatest possible amount of video PES packet data. Every multiplexing unit other 
than the last multiplexing unit therefore stores 2024 bytes (= 184 x 11 bytes) of data. 

[0451] Using this definition makes it possible to predefine such fields as the PES_packet_length and stuff ing_byte 
of each pack at TS2PS conversion. 

[0452] The last multiplexing unit storing data for one video PES packet fills the remaining data capacity with the 
55 adaptation field and Null packets to form one complete multiplexing unit. 

[0453] As shown in Figs. 65A and 65B, the following three types of multiplexing units are used to form one video 
PES packet: the first multiplexing unit storing the first data in the PES packet (MU #1 in the figures); multiplexing units 
(MU #n where n = 2, 3, ... N-1) storing data in the middle of the PES packet; and the multiplexing unit (MU #N) storing 



32 



BNSDOCID: <EP 1422710A1_I_> 



EP 1 422 710 A1 



the last PES packet data. 

[0454] The structure of the packs corresponding to these multiplexing unit types in the MPEG_PS stream resulting 
from TS2PS conversion is shown in Fig. 65B. 

[0455] The pack converted from MU #1 always contains at least 1 0 bytes of empty space, and padding packets are 
5 therefore inserted at the end when the pack is generated. 

[0456] This is because the DVD format specifies that stuffing bytes (last field of the packet header) are added to a 
total 2048 bytes when there is a space of 7 bytes or less in the pack, and padding packets are added if the space is 
8 bytes or larger, 

[0457] One stuffing byte is added to the packs converted from MU #n to complete each pack. 
w [0458] The pack converted from MU #N normally has a space of 8 bytes or larger, and a padding packet is therefore 
inserted. 

<Conversion to an audio pack (A_PCK)> 

15 [0459] Figs. 66A and 66B shows conversion from Constrained SESF to MPEG_PS. As shown in Fig. 66A, one audio 
PES packet (storing one or more audio frames) is smaller than one multiplexing unit. 

[0460] Because one audio PES packet will fit in one multiplexing unit, complex conversion is not required as it is for 
a video PES packet. More specifically, packs to which padding packets are added should always be generated as 
shown in Fig. 66B. 

20 [0461] Furthermore, because PES_packet_length does not change with TS2PS conversion, only simple calculations 
are required for conversion. These include appropriately setting the stream_id when converting MPEG-1 Audio, and 
generating the AC-3 private header. 

[0462] As also shown in the figure, buffer management can be simplified by minimizing the audio data transfer time, 
which is the greatest factor complicating system encoding a Constrained SESF. 
25 [0463] Because video data and other PSI/SI packets cannot be transferred when audio multiplexing units are being 
transferred, the overall transfer rate drops (i.e., image quality drops), and as the transfer time increases the video data 
must be transferred that much earlier on the transport stream (thus complicating system encoding). The audio multi- 
plexing unit transfer time is therefore preferably as short as possible. 

[0464] In other words, transferring the audio multiplexing unit in a shorter time means increasing the audio transfer 
30 rate. This is connected to reducing the difference between the allowed audio input rates, which is a major difference 
between the T_STD and P_STD. A major benefit of this is to also simplify generating a Constrained SESF which must 
conform to both decoder models. 

[0465] Fig. 67 shows the audio bit rates allowed in a Constrained SESF and the maximum payload length stored to 
one audio PES packet when AC-3 and MPEG-1 Audio is stored at each bit rate. Because data longer than the shown 
35 byte lengths will not fit in one audio PES packet, padding packets are inserted. 

<TS2PS conversion process> 

[0466] The TS2PS conversion process is detailed next below with reference to flow charts of Fig. 68 to Fig. 79. 
40 [0467] Fig. 68 is a flow chart of the main TS2PS conversion process. This process starts when a user inputs a TS2PS 
conversion request. The data recording apparatus then seeks the SESF capsule from which conversion starts (S11) 
and determines if the SESF capsule to be processed is present (S1 2). If it is not, the process ends. If the SESF capsule 
is present, an initialization process (S13) and capsule unit process (S14) are run. 

[0468] The initialization process (S13) is described with reference to the flow chart in Fig. 69. This process sets and 

45 initializes the variables and other parameters used in the following process. 

[0469] Whether a Tip packet has been read is first determined (S21). If a Tip packet has not yet been read, a Tip 
packet is read (S22). The ATS value of the read Tip packet is then set to variable ATSTip (S23) , the PCR value of Tip 
packet is set to variable PCRTip (S24). Variable MU_num specifying the number of the multiplexing unit being proc- 
essed is set to 0 (S25), and variable WA indicating how many times an ATS overflow occurred is set to 0 (S26). 

50 [0470] The capsule unit process (S1 4) is described with reference to the flow chart in Fig. 70. This process starts by 
reading a TS packet (S31) and then detecting if the read TS packet is a Tip packet (S32). Processing ends if it is a Tip 
packet. If not a Tip packet, it is determined whether the read TS packet contains an audio packet or video packet (S33). 
If the read TS packet does not contain an audio packet or video packet, control loops back to step S31 , and TS packets 
are sequentially read until the read TS packet is an audio packet or video packet (S31 to S33 repeat). 

55 [0471] When the read packet is an audio or video TS packet, the next 1 0 TS packets are also read (S34). MU_num 
is then incremented (S35). The ATS value of the first TS packet in the multiplexing unit is stored to variable ATS 
[MU_num] (S36). The byte length of the payload data in the PES packet stored to the multiplexing unit is set to 
payloadjen (S37). The pack unit process is then run (S38). 
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[0472] As shown in the flow chart in Fig. 71 , the pack unit process includes an SCR calculation process (S41 ), pack 
header process (S42), packet header process (S43), payload process (S44), and padding packet process (S45). These 
processes are described below. 

[0473] The SCR calculation process is described with reference to the flow chart in Fig. 72. 

[0474] This process determines the SCR value of the pack. The first step is to determine whether the multiplexing 
unit is the first multiplexing unit in the SESF capsule by referencing variable MU_num (S51 ). If it is, the value of ATSTip 
is set to variable ATS[0] and the value of variable PCRTip is set to variable SCR[0] (S52-S53). 

[0475] ATS[MU_num] and ATS[MU_num-1] are then compared (S55). The ATS value of the first packet in the mul- 
tiplexing unit is stored to ATS[i], and this ATS value indicates the relative transfer timing referenced to a particular 
packet. Therefore, the ATS value of each subsequent packet is normally greater than the ATS value of the previous 
packet. However, because the ATS is generally limited to a finite value expressible with 30 bits, ATS overflow can 
occur. In this case the ATS value of a certain packet may be smaller than that of the preceding packet. Step S54 
monitors this reversal of ATS values to determine when an ATS overflow occurs. If ATS [MU_num] is not greater than 
ATS[MU_num-1], that is, if an ATS overflow occurred, variable WA is incremented (S55). 

[0476] The greater one of SCR[MU_num-1] + T and (PCRTIP + ATS[MU_num] - ATSTip + WAx BS) is then set to 
SCR[MU_num] (S56). 

[0477] The pack header process is described with reference to the flow chart in Fig. 73. 

[0478] This process edits the pack header data in the data structure shown in Fig. 60. The remainder of the SCR 
divided by 300 is first inserted to SCR_extension (S61), and the quotient is set to SCR_base (S62). The 
program_mux_rate is set to "0x6270" (S63), and pack_stuffingjength to "000b" (S64). Other fields are edited appro- 
priately to complete the pack header data (S65). 

[0479] The packet header process is described with reference to the flow chart in Fig. 74. 

[0480] This process starts by running a stream ID process for setting the stream ID (S71 ). Whetherthe first TS packet 
of the multiplexing unit contains a PES packet header is then determined (S72). If the first TS packet of the multiplexing 
unit contains a PES packet header a PES packet leading process is run (S73), and a non-PES packet leading process 
is otherwise run (S74). Whether the first TS packet of the multiplexing unit contains a PES packet header can be 
determined by reading the payload_unit_start_indicator field of the TS packet header, or by directly detecting if a PES 
packet header start code is stored. 

[0481] The stream ID process is described with reference to the flow chart in Fig. 75. 

[0482] This process sets the value of the stream_id field. If the type of the stream being processed is "MPEG-2 
Video", the stream_id is set to "OxEO" (S81, S82). If the stream type is n AC-3 audio", the stream_id is set to "OxBD" 
(S83, S84). If the stream type is "MPEG-1 Audio" and "primary audio", the stream Jd is set to "OxCO" (S85, SB6, S87). 
If the stream type is "MPEG-1 Audio" and "secondary audio", the strearnjd is set to "0xC1 " (S85, S88, S89). 
[0483] The PES packet leading process is described with reference to the flow chart in Fig. 76. 
[0484] The structure of a PES packet according to the MPEG standard is shown in detail in Fig. 81 . This process 
edits the PES packet fields according to the structure shown in Fig. 81 . 

[0485] Whetherthe stream type is "MPEG-2 Video" is determined first (S91 ). If it is, PES_packet_length is set to the 
value determined by the following equation (S92). 

[0486] PES_packet_length = (3 + PES_header_dataJength) + payloadjen 

[0487] The 3 bytes from "10" to PES_header_data_length for each field of the TS packet before conversion (see 
Fig. 81) are copied directly to the corresponding fields in the packet header of the MPEG_PS pack after conversion 
(S93), Whether the PTS is present is then determined by referencing PTS_DTS_flags in the TS packet before conver- 
sion (S94). If the PTS is present, it is copied directly to the corresponding field in the packet header of the PS pack 
after conversion (S95). DTS presence is similarly determined by referencing PTS_DTS_flags in the TS packet before 
conversion (S96), and if the DTS is detected the DTS value is copied directly to the corresponding field in the packet 
header of the PS pack after, conversion (S97). Whether PES_extensionJlag is set to 1 is determined (S98), and the 
steps described below are run if PES_extension_flag = 1 . 

[0488] The stream type is detected and the 3 bytes from PES_private_data_flag to P_STD_buffer_flag are overwrit- 
ten with predetermined values according to the stream type. That is, if the stream type is "MP EG2- Video" (S99), the 
3 bytes from PES_private_data__flag to P_STD_buffer_flag are overwritten with the value "0x1 E60E8" (S1 00). If the 
stream type is "AC-3 Audio" (S101), it is overwritten with M 0x1E603A n (S102). If the stream type is "MPEG1 -Audio" 
(S103), it is overwritten with "0x1 E4020" (S104). 

[0489] The non-PES packet leading process is described with reference to the flow chart in Fig. 77. 
[0490] The 2 bytes from "10" to PES_extension_flag in the PES packet are set to "0x8000" (S111). Whether 
payloadjen is less than 2018 is then detected (S112). The payloadjen is the PES packet data length in one multi- 
plexing unit, and the maximum allowable value is therefore 184 x 11 = 2024 bytes. If payloadjen is less than 2018, 
PES_header_dataJength is set to 0 (S1 1 3). If payloadjen is 201 8 or greater, PES_header_dataJength is set to (2025 
- payloadjen) (S114), and payloadjen is stuffed the byte length of PES_header_dataJength (S115). 
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PES_packet_length is set to the value determined by the following equation (S116). 

PES_packetJength = (3 + PES_header_data_length) + payloadjen 

5 

[0491] The payload process is described with reference to the flow chart in Fig. 78. 

[0492] Variable i is set first (S121) , and the payload data of the PES packet stored to the i-th TS packet is read 

(5122) . The payload data of the PES packet stored to the i-th TS packet is then added to the payload of the pack 

(5123) and variable i is incremented (S124). These steps repeat until variable i is greater than 12 (S125). That is, this 
10 process repeats until all TS packets contained in one multiplexing unit are processed (S122 to S125). 

[0493] The padding packet process is described with reference to the flow chart in Fig. 79. 

[0494] Whether the PES_packet_length is set to 2028 is determined (S131). If PES_packetJength is not 2028, 
PES_packet_length of the padding packet is set to ((2028 - PES_packet_length) - 6] (S132), and padding packets are 
added after the payload (S133). 

15 [0495] It should be noted that the TS packet storing the video PES packet header is described above as being placed 
at the beginning of the video multiplexing unit, but this constraint can be eliminated if sequential processing by 
MPEG_PS pack unit is allowed. The result of this will be that the data for the next picture will be stored even in the 
multiplexing unit storing the last data of each picture, and the video bit rate can therefore be increased accordingly. 
[0496] Furthermore, because the PES_packet_length indicating the length of the video PES packet is set to 0 above, 

20 there is a problem that the PES_packet_length of the packet header after conversion to a pack cannot be determined 
until data writing to the pack completes. The PES_packet_length for each video PES packet in the SESF capsule can 
be written to the Tip packet. The PES_packet_ length can therefore be determined by sequential processing of TS 
packet units, and conversion can proceed even more quickly. 

[0497] Furthermore, the pack header (SCR) is described above as calculated during TS2PS conversion, but the 
25 pack header can be previously stored to the PES packet header stored in the MPEG_TS. For example, the pack header 
after TS2PS conversion could be stored to the PES packet header with a pack_header_field_f lag in the PES packet 
header set to 1 b. The data stored to the pack storing the pack header includes the data stored in packets from the TS 
packet to a TS packet determined by a specific rule (for example, a specific number to TS packets). 
[0498] When self-encoding externally input AV data to an MPEG transport stream format, the data recording appa- 
30 ratus and method of the invention described above can thus efficiently encode and decode the streams while main- 
taining decoder compatibility. 

[0499] Furthermore, because user private data can be stored to the streams recorded to the data recording medium, 
the added value of recorded content in the MPEG transport stream format can be increased. 

[0500] Moreover, because the stream is multiplexed in block units of 2 KB or less so that an MPEG_TS recorded to 
35 a data recording medium can be efficiently and easily converted to an MPEG_PS, the MPEG_TS can be very easily 
converted to an MPEG_PS without considering buffer management. 

[0501] As described above, according to the invention, flag information indicating that a first stream (an MPEG trans- 
port stream, for example) is recorded in a constrained formed enabling conversion to a second stream (an MPEG 
program stream, for example) is recorded in the management information. This males it possible to recognize, without 
40 analyzing the data recorded to the data recording medium, whether the recorded data has been recorded in the specified 
format, thereby improving the efficiency of the recognition process. 

[0502] Although the present invention has been described in connection with the preferred embodiments thereof 
with reference to the accompanying drawings, it is to be noted that various changes and modifications will be apparent 
to those skilled in the art. Such changes and modifications are to be understood as included within the scope of the 
45 present invention as defined by the appended claims, unless they depart therefrom. 



Claims 5. 

50 1. A data recording apparatus for encoding video data and audio data to a system steam and recording the system 
steam to a recording medium, a first type format (PS) and a second type format (TS) being allowed for the system 
stream, 

the apparatus comprising: 

55 a first encoding section that encodes video data and audio data in a predetermined method to generate video 

elementary stream and audio elementary stream, based on the second type format (TS); 
a second encoding section that performs system-encoding so that the video elementary stream and the audio 
elementary stream are multiplexed to generate the system stream, based on the second type format (TS); and 



35 



BNSDOCID: <EP 142271 OA 1J_> 



EP1 422 710 A1 



a controller that controls the first and second encoding sections; 

wherein a constrained format that enables conversion of the system stream from the second type format 
(TS) to the first type format (PS) is allowed for the second type encoding format (TS) : and 

the controller controls the first and second encoding sections so as to encode with the constrained format. 

The data recording apparatus according to claim 2, wherein, when the system stream in the second type format 
(TS) is converted to the system stream in the first type format (PS), it is unnecessary to re-encode the elementary 
stream. 

The data recording apparatus according to claim 1 , wherein, when the system stream in the second type format 
(TS) is converted to the system stream in the first type format (PS), it is unnecessary to change an order of mul- 
tiplexing of the video elementary stream and the audio elementary stream those composing the system stream. 

The data recording apparatus according to claim 2, wherein, several types of encoding methods can be allowed 
for the first type format (PS) and the second type format (TS), and the controller controls the first encoding section 
so as to encode the elementary stream with the encoding method which is allowed for both the first and second 
type formats. 

The data recording apparatus according to claim 3, wherein, 

the second type stream (TS) stores data segmented into packets and has a packet structure in which time 
stamp information indicating a relative transfer timing is added to each packet, 

the second type stream (PS) stores data segmented into packs and has a pack structure in which time stamp 
information indicating a relative transfer timing is added to each pack, the pack size is larger than the packet size, 
and 

the controller controls the second encoding section so that a predetermined number of packets are grouped 
and managed as a unit for multiplexing, and a total data size of the packets managed as a unit does not exceed 
a data size of one pack. 

A data recording method for encoding video data and audio data to a system steam and recording the system 
steam to a recording medium, a first type format (PS) and a second type format (TS) being allowed for the system 
stream, 

the method comprising: 

encoding video data and audio data in a predetermined method to generate video elementary stream and 
audio elementary stream, based on the second type format (TS); 

performing system -encoding so that the system stream is generated from the video elementary stream and 
the audio elementary stream, based on the second type format (TS); and 
controlling 'the encoding, and the multiplexing and system-encoding, 

wherein a constrained format that enables conversion of the system stream from the second type format 
(TS) to the first type format (PS) is allowed for the second type encoding format (TS), and the controlling controls 
the encoding and the performing so as to encode with the constrained format. 



.142271 OA 1_l_> 



36 



EP1 422 710 A1 




BNSDOCID: <EP 142271 OA 1_l_> 



37 



EP1 422 710 A1 




BNSDOCID: <EP 1 42271 0A1_I_> 



38 



EP1 422 710 A1 




BNSDOCID: <EP 1422710A1_I_> 



39 



EP1 422 710 A1 




40 

BNSDOCID: <EP 1422710A1_I_> 



EP1 422 710 A1 



Fig 5 A 
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Fig. 74 
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